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^ (57) Abstract: A compound comprising a taiget cell-specific portion and a cytotoxic portion characterised in that the target cell-spe- 
^ cific portion comprises a humanised monoclona] antibody having specificity for polymoiphic epithelial mucin (PEM), or an antigen 
binding figment thereof, and the cjrtotoxic portion has endonucleolytic activity. Preferably, the taiget cell-specific portion com- 
^ prises a humanised HMFG-1 andbody or an antigen binding fii'agment thereof. Advantageously, the cytotoxic portion is at least the 
^ catalyticaUy active portion of a DNA endonndease, e.g. a human DNA endonudease L The invention further provides nucleic adds 
1^ encoding the compouRds of the invention, and the use of such compounds in medidne, e.g. in the treatment of cancer. 
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COMPOUNDS FOR TARGETING 

The present invemion relates to cytotoxic confounds fliat have a high 
5 avidity for, and can he targeted to, selected cells. Specifically, the 
invention provides compounds conq>rising a cytotoxic portion having 
DNA endonacleolytic activity and a target-cell specific portion having 
specificity for human polymoiphic epifliehal mucin (PEM). 

' 10 Backgroimd 



The cell-specific targeting of con5)om3ds that are direcdy, or indirectly, 
cytotoxic has been proposed as a way to combat diseases such as cancer. 
Bagshawe and his co-woricers have disclosed (Bagshawe (1987) Br. J. 

15 Cancer 56, 531; Bagshawe et ed (1988) Br. J. Cancer 58, 700; WO 
88/07378) conjugated conrpounds comprising an antibody or part thereof 
and an enzyme, the antibody being q)ecific to tumour cell antigens and the 
enzyme acting to convert an mnocaons pro-drag ruto a qtotoxic 
compound. The cytotoxic compounds were alkylating agents, e.g. a 

20 benzoic acid mustard released fi:om ;7ara-N-bis(2- 
d3laroefliyl)aniinobenzoyl ghitamic acid by the action of Pseudomonas sp. 
C3PG2 enzyme. 

An alternative system using different pro-drugs has been disclosed 
25 (WO 91/11201) by Epenetos and co-workers. The cytotoxic compounds 
were cyanogenic monosaccharides or disaccharides, such as the plant 
compound amygdalm, which release cyanide upon the action of a p- 
ghicosidase and hydroxynitrile lyase. 
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In a further alternative system, the use of antibody-enzyme conjugates 
containing the enzyme alkaline phosphatase in conjunction with the pro- 
drug etoposide 4'-phosphate or 7-(2'-aminoethyl phosphate)mitomycin or 
5 a combination thereof have been disclosed (EP 0 302 473; Senter et al 
(1988) Proc. Natl Acad. ScL USA 85, 4842). 

Rybak and co-workers have disclosed (Rybak et al (1991) J. Biol. Chem. 
266, 21202; WO 91/16069) the cytotoxic potential of a monomeric 
10 pancreatic ribonuclease when injected directly into Xenopus oocytes and 
the cytotoxic potential of monomeric RNase coi5)led to human transferrin 
or antibodies directed against the transferrin receptor. The monomeric 
RNase hybrid proteins were cjrtotoxic to human er3^oleukaemia cells in 
vitro. 

15 

Other approaches are the in vivo application of streptavidin conjugated 
antibodies followed, after an appropriate period, by radioactive biotin 
(Hnatowich et al (1988) J. Nucl. Med. 29, 1428-1434), or injection of a 
biotihylated mAb followed by radioactive streptavidin (Paganelli et al 
20 (1990) Int. J. Cancer 45, 1184-1189). A pilot radiounmunolocalisation 
study in non-small cell lung carcinomas was conducted with encouraging 
results (Kalofonos et al (1990) J. Nucl Med. 31, 179M796). 

Apart from these exanq}les, it is rather more common to see biotmylated 
25 antibodies and streptavidin-enzyme conjugates, which are used in enzyme- 
linked immxmosorbent assays. 

These previous systems have used relatively large antibody-enzyme. 
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antibody-streptavidin or antibody-biotm conjugates md may comprise 
portions of non-mammalian origin which are highly immimoreactive. 

We have now devised improved con5)omids for targeting cells to be 
5 destroyed. 

Stonmary of Invention 

A first aspect of fhe invention provides a compoimd conqirising a target 
10 cell-specific portion and a cytotoxic portion characterised in that the target 
cell-specific portion comprises an hmnanised monoclonal antibody having 
specificity for polymorphic epithelial mucm (PEM), or an antigen bmdmg 
fi:agment thereof, and the cytotoxic portion has endonucleolytic activity. 

15 By "target cell specific" portion we mean the portion of the compound 
which comprises one or more bmding sites which recognise and bmd to 
polymoipWcepifheM mucin OPEM) on the target cell. Upon contact with 
^ the target cell, the target cell specific portion is preferably internalised 
along with fhe cytotoxic portion. Such intemalisation results in fhe 

20 cytotoxic portion bemg delivered to the cell cytosol, where it has access 
to the cell's nucleic acid molecules. 

The target cell-specific portion of fhe compounds of fhe invention 
comprises an humanised monoclonal antibody having specificity for 
25 polymorphic epithelial mucm (PEM), or an antigen bmding firagment 
thereof. 

Potymoiphic epiSielial mucin, or PHM, is a corrqponent of fhe human milk 



I 
r 
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fat globule. PEM is expressed by cells in several body tissues £uid is also 
found in urine. Significantly, PEM is known to be expressed in epithelial 
cancer cells, notably in ovarian, gastric, colorectal and pancreatic cancer 
cells. 

5 

Monoclonal antibodies which will bind to PEM are ahready known, but in 
any case, with today's techniques in relation to monoclonal antibody 
technology, antibodies can be prepared to most antigens* The antigen- 
specific portion may be a whole antibody, a part of an antibody (for 

10 example a Fab or F(ab')2 fragment), a synthetic antibody firagment (for 
example a single chain Fv fragment [ScFv]), or a peptide/peptidomimetic 
or similar. Suitable monoclonal antibodies to selected antigens may be 
prepared by known techniques, for example those disclosed in 
"Monoclonal Antibodies: A manual of techniques'', H Zola (CRC Press, 

15 1988) and in "Monoclonal Hybridoma Antibodies: Techniques and 
Applications", J G R Hurrell (CRC Press, 1982) and Antibody 
Engineering, A Practical Approachy McCafferty, J. et al, ed. (JKL Pres, 
.1996). 

20 By 'humanised monoclonal antibody' we include monoclonal antibodies 
having at least one chain wherein the framework regions are 
predominantly derived from a first, acceptor monoclonal antibody of 
human origin and at least one complementarity-Kietermining region (CDR) 
is derived from a second, donor monoclonal antibody having specificity 

25 for PEM. The donor monoclonal antibody may be of human or non- 
human origm, for exanq)le it may be a murine monoclonal antibody. 

Preferably, both diains of the huuMnised monoclonal antibody comprise 
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CDRs grafted from a donor monoclonal antibody having specificity for 
PEM. 

Advantageously, the CDR-grafted Cue. humanised) chain comprises two 
5 or all teee CDRs derived from a donor antibody having specificity for 
PEM. 

Convenienfly, the humanised monoclonal antibody comprises only human 
framework residues and CDRs from a donor antibody having specificity 
10 for PEM. 

However , it will be appreciated by those skilled in the art that m order to 
maintain and optmiise the specificity of the humanised antibody it may be 
necessary to alter one or more residues in the framework regions such that 
15 they correspond to equivalent residues in the donor antibody. 

Conveniently, the framework regions of the humanised antibody are 
derived from an human IgG monoclonal antibody. 

20 Methods of Tnalring humauised monoclonal antibodies are weE-known in 
the art, for example see Jones et aL (1986) Nature 321:522-525, 
Riechmann et al. (1988) Nature 332:323-327, Verhoeyen et cd. (1988) 
Science 239:1534-1536 and EP 239 400 (to Wmter). 

25 In a preferred embodiment of the first aspect of the invention, the target 
cell-specific portion comprises an humanised HMFG-1 monoclonal 
antibody or an antigen binding fragment thereof. 
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HMFG antibodies are raised against human milk fat globule (EIMFG), in 
a del5)idated state (see Taylor-Papadimiriou et al., 1981, /nr. J. Cancer 
28:17-21 and Gendler et al, 1988, /. Biol Chem. 236:1282-12823). 
HMFG-1 monoclonal antibodies bind to a particular component of 
5 HMFG, namely polymorphic epithelial mucin (PEM). Binding is thought 
to involve the amino acid sequence APDTR within the twenty amino acid . 
tandem repeats of the muc-l gene product. 

Exemplary hamanised HMFG-1 antibodies are disclosed in WO 92/04380. 

10 

Advantageously, the target cell-specific portion is an humanised HMFG-1 
monoclonal antibody. 

In a preferred embodiment of the first aspect of the invention, the target 
15 cell-specific portion comprises a fragment of an humanised monoclonal 
antibody having specificity for polymorphic epithelial mucin (PEM), said 
fragment retaining the antigen binding properties of the parent antibody. 

The variable heavy (Vh) and variable light (V^) domains of the antibody 
20 - are involved in antigen recognition, a fact first recognised by early 

proteaise digestion experiments. Further confirmation was found by 

"humanisation" of rodent antibodies. Variable domains of rodent origin 
' may be fused to constant domains of human origin such that the resultant 

antibody retains the antigenic specificity of the rodent parented antibody 
25 \Morrison et al (1984) Proc. Natl Acad, Scu USA 81, 6851-6855). 

That antigenic specificity is conferred by variable domains and is 
indq)endent of the constant domaios is known firom experiments involving 
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the bacterial expression of antibody fragments, all containing one or more 
variable domains. These molecules include Fab-like molecules (Better et 
al (1988) Science 240, 1041); Fv molecules (Skeiia et cd (1988) Science 
240, 1038); disulphide-linked Fv molecules (Young et aZ., 1995, FEBS 
5 Lett, 377:135-139); single-chain Fv (ScFv) molecules where the Vh and 
Vl partner domains are linked via a flexible oligq)eptide (Bird et al (1988) 
Science 242, 423; Huston et al (1988) Proc. Natl Acad. Sci. USA 85, 
5879) and single domain antibodies (dAbs) comprising isolated V domains 
(Ward et al (1989) Nature 341, 544). A general review of the techniques 
10 involved in the synthesis of antibody fragments which retain then: specific 
bindmg sites is to be found in Wmter & Milstein (1991) Nature 349, 293- 
299. 

By "ScFv molecules" we mean molecules wherein the Vh and Vl partner 
15 domains are linked via a flexible oligopeptide. 

Caiimaeric antibodies are discxissed by Neuberger et al (1988, 8th 
1 International Biotechnology Symposium Part 2, 792-799). 

20 The advantages of using antibody fragments, rather than whole antibodies, 
are several-fold. The smaller size of the fragments allows for rapid 
clearance, and may lead to improved tumour to nonrtmnour ratios. Fab, 
Fv, ScFv, disulphide Fv and dAb antibody fragments can aH be e:q)ressed 
in and secreted from bacteria, such as E. coli, or eukaryotic expression 

25 systems such as Yeast or mammalian systems, thus allowing the facile 
production of large amounts of the said fragments. 

Whole antibodies, and F(ab')2 fragments are 'T)ivalenf . By ''bivalent" we 
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mean that the said antibodies and F(ab')2 fragments have two antigen 
combining sites. In contrast, Fab, Fv, ScFv, disulphide Fv and dAb 
fragments are monovalent, having only one antigen combining site. 

5 Preferably, the target ceU-specific portion of the compounds of the 
invention comprises an antigen binding fragment of the humanised 
antibody selected from the group consisting of Fab-like molecules, such 
as Fab and F(ab')2, Fv molecules, disulphide-linked Fv molecules, ScFv 
molecules and single domain antibodies (dAbs). 

10 

More preferably, the target cell-specific portion comprises a Fab molecule 
or a F(ab% molecule. 

Yet more preferably, the target cell-specific portion comprises an amino 
15 acid sequence encoded by at last part of one or both of the nucleotide 
sequences of Figure 3(a) and (d) . 

Most preferably, the target cell-specific portion comprises an amino acid 
sequence encoded by the nucleotide sequence of Figure 3(a) and an amino 
20 add sequence encoded by die nucleotide sequence of Figure 3(d). 

Preferably, the target cell-specific portion recognises the target cell with 
high avidity. 

25 By **hi^ avidity** we mean that the target cell-specific portion recognises 
the target cell with a binding constant of at least = 10"^ M, preferably 
at least = lO'^'M, suitably = lO'^^M, more suitably = 10"" M, 
yet more suitably still = 10*^ M, and more preferably = ICT^^M or 
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eveiiE:d = 10"'^M. 

Preferably, flie target cell-specific portion comprises an antigen binding 
fiagment of an humanised HMFG-1 monoclonal antibody, e.g. an Fab or 
5 F(ab')2 fragment thereof, whereitt a hinge region contains a mutation 
(i.e. wherein the hinge is a variant or hybrid of a naturally occurring 
hmge). More preferably, the variant hinge comprises the ammo acid 
sequence CCVECPPCPAPE. 

10 By ^cytotoxic portion' we mean a portion having endonucleolytic activity 
which is toxic to the cell if it is to reach, and preferably enter said cell. 

In a prefened embodiment of the first aspect of the mvention, Ihe 
cytotoxic portion has DNA endonucleolytic activity. 

15 

Advantageously, the cytotoxic portion is at least the catalytically active 
portion of a DNA endonuclease. 

Examples of known DNA endonucleases mclude bovine DNase I (see 
20 WorraU andConolly, 1990, J. Biol. Chem. 265:21889-21895). Human 
pancreatic DNase I has also been cloned (see Shak et al, 1990, Proc. 
Natl Acad. Sci. USA 87:9188-9192 and Hubbard et al. , 1992, New Eng. 
/. Me(f. 326:812-815). 

25 Preferably, the endonuclease is a mammaliaTi deoxyribonuclease I. 



More preferably, the endonuclease is a human deoxyribonuclease L 
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Most preferably, the cytotoxic portion comprises the aTninn acid sequence 
shown in Figure 2(a) or 2(b). 

Preferably, the cytotoxic portion of the compound of the invention is 
5 capable of oligomerisation, e.g. dimerisation. Attachment of the target- 
cell specific portion to a cytotoxic portion capable of oligomerisation 
provides a method for increasmg the number of bmding sites to the target 
cell. For example, if the target cell-specific portion is jomed to a portion 
capable of forming a dimer then the number of target cell-specific binding 

10 sites is two; if the target cell-specific portion is joined to a portion capable 
of forming a tetramer then the number of target cell-specific bindmg sites 
is four. The number of target cell-specific binding sites is greater than 
one and the con5)ounds may therefore have a greater avidity for the target 
cell than do compounds which only have one target cell-specific bmding 

15 site. 

I It is preferable for the C3^toxic portion of the compound of the iuvention 
capable of ohgomerisation to contain no interchain disulphide bonds nor 
- intrachain disulphide bonds; to be well characterised; to be non-toxic; to 
20 be stable; to be amenable to preparation m a form suitable for pre-clinical 
or clinical use or be in pre-clkucal or clinical use; and for the subunit 
monomers to have a high affini ty for each other, that is they contain one 
or more subunit binding sites. 

25 Advantageously, the cytotoxic portion is of mammalian, preferably 
human, origm. The use of the said mammalian proteins as the cjrtotoxic 
portion of the compound of the invention is advantageous since such 
compounds are less likely to give rise to undesirable immune reactions. 
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It will be appreciated by those skilled in the art that the cjrtotoxic portion 
may be a variant of a naturally occurring endonuclease. 

5 By "a variant" we include cytotoxic portions comprising of a naturally 
occurring endonuclease wherein there have been amino acid insertions, 
deletions or substitutions, either conservative or non-conservative, such 
that the changes do not substantially reduce the endonuclease activity of 
the variant compared to that of the naturally occurring endonuclease. For 
10 exaniple, the variant may have inca:eased activity compared to the naturally 
occurring endonuclease 

Such variants may be made using methods of protem engineering and site- 
directed mutagenesis commonly known in the art (for example, see 
15 Sambrook et al., 1989, Molecular cloning: A Laboratory Manual, 2^ 
edition, Cold Spring Harbor Laboratory Press, NY, USA). 

In an alternative embodiment, the endonuclease is a restriction 
endonuclease, such as a microbial type II restriction endonuclease. 
20 Exemplary type II restriction endoimcleases include BaniHI, HiruER^ 
Mspl, SaiiSAL, Hinfi, Notl and EcoRI. 

In another preferred embodiment of the first aspect of the invention, a 
nuclear localization signal is incorporated into the compound. 

25 

Preferably, the nuclear localization signal (NLS) comprises a nuclear 
localization signal from the SV40 large T antigen (Kalderon et al. , 1984, 
Cell 39:499-509), and specifically the amino acid sequence PKXKRKV. 
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Inclusion of a nuclear localization signal encourages the compound of flie 
invention to gain access to the chromosomal DNA during the periods of 
the cell cycle when the nuclear menibrane is intact, since the nuclear pores 
are permeable to large molecules incorporating said nuclear localization 
5 signal. 

In a further preferred embodiment of the first aspect of the invention, the 
- target cell-specific portion and the cytotoxic portion are fiised to create a 
fusion compound. 

10 

By "fusion compound" we include a compound comprising one or more 
functionally distinct portions, wherein the distinct portions are contained 
within a single polypeptide chain produced by recombinant DNA 
techniques. For example, the compound may comprise a whole antibody 
15 wherein the heavy chain is fused to human DNase I. Alternatively, the 
compound may comprise an Fab or F(ab')2 fragment of an antibody 
whereiQ the truncated heavy chain {Le. the Fd chain) is fused to human 
DNase I. 

20 Preferably, the target-cell specific and the cytotoxic portion of the fusion 
compound of the invention separated by a linker sequence, for example to 
allow greater flexibility of the portions relative to one another. 

More preferably, the linker sequence comprises a GG dipeptide. 

25 

Most preferably the linker sequence is or comprises GG or GSGG. 
Alternatively, the target-cell specific and the cytotoxic portion of the 
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compound of the inventioii are separate moieties linked together by any of 
the conventional ways of cross-linking polypeptides, such as those 
generally described in O^SuUivan et alAml. Biochem. (1979) 100, 100- 
108. For example, flie antibody portion may be enriched with thiol 

5 groups and flie enzyme portion reacted with a bifimctional agent capable 
of reacting wifli those thiol groups, for exan5)le die N-hydroxysuccinimide 
ester of iodoacetic acid (NHIA) or N-succinimidyl-3-(2- 
pyridyldifliio)propionate (SPDP). Amide and thioether bonds, for 
example achieved with m-maleinndobenzoyl-N-hydroxysuccinimide ester, 

10 are generally more stable in vivo than disulphide bonds. 

In a prefened embodiment of the first aspect of the invention, the 
compound comprises all or part of the amino acid sequence as shown in 
Figure 3(c) (i.e. an HMFG-1 light chain) together with all or part of an 
15 amino acid sequence selected from the group consisting of amino acid 
sequences as shown in Figures 5(d), 6(d), 7(b), 8(b), 9(b), 10(b), 11(b), 
12(b), 13(d), 14(d), 15((i), 16(c), 17(d), 18(d) and 19(d) (Le. anHMFG-1 
heavy or Fd chain/DNase fusion). 

20 Advantageously, the con5)ound is a whole HMFG-1 antibody/human 
DNase I fusion conq)ound con5)rising an amino acid sequence as shown 
in Figure 3(c) and an amino acid sequence as shown in Figure 7(b). 
Preferably, the compound is a tetrameric compound con5)rising two 
HMFG-1 li^t chains and two HMFG-1 heavy chain /DNase I fusions. 

25 

Conveniently, the confound comprises an amino acid sequence as shown 
in Figure 3(c) and an amino acid sequence as shown in Figure 14(d). 
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Preferably, the compound comprises one of the pairs of amino acid 
sequences defined above wherein the leader sequence of each amino acid 
(the first 19 amino acids of the sequences shown in each figure) is 
removed. It will be appreciated by persons skilled in the art that the 
5 compounds of the invention may also comprise variants of such amino 
acid sequences. 

Suitably, die confound is a tetrameric confound comprising two HMFG- 
1 light chains and two HMFG-1 Fd chain /DNase I fusions. More 
10 preferably, tiie compound is a dimeric compound comprising one HMFG- 
1 light chain and one HMFG-1 Fd chain /DNase I fusion. 

A second aspect of the invention provides a nucleic acid molecule 
encoding a compound accordhig to the first aspect of the invention, or a 
15 target cell-specific portion or cj^otoxic portion thereof. 

By "nucleic acid molecule" we include DNA, cDNA and mRNA 
molecules. 

20 In a preferred embodiment of the second aspect of the invention, the 
nucleic acid molecule comprises all or part of the nucleotide sequence as 
shown in Figure 3(a or b) (i,e, encoding an HMFG-1 light chain) together 
with all or part of a nucleotide sequence selected from the group 
consisting of nucleotide sequences as shown in Figures 5(a, b and c), 6(a, 

25 b and c), 7(a), 8(a), 9(a), 10(a), 11(a), 12(a), 13(a, b and,c), 14(a, b and 
c), 15(a, b and c), 16(a and b), 17(a, b and c), 18(a, b and c) and 19(a, b 
and c) (Le. encoding an HMFG-1 heavy or Fd chain/DNase fusion). 
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Advantageously, the nucleic acid molecule comprising a nucleotide 
sequence as shoAvn in Figure 3(b) and a nucleotide sequence as shown in 
Figure 7(a). 

5 Conveniently, the compound coniprises a nucleotide sequence as shown in 
Figure 3(b) and a nucleotide sequence as shown m Figure 14(c). 

Alternatively, the nucleic acid molecule comprises nucleotide sequences 
that are degenerate sequences of those nucleotide sequences identified 
10 above (i.e. which encode the same amino acid sequence). 

A further aspect of the present invention provides a method of making a 
compound according to the first aspect of the invention, said method 
comprising expressing one or more nucleic acid molecules accordiug to 
15 the second aspect of the invention in a host cell and isolating the 
compound therefrom. 

It is pr^sraWe that the two portions of the compound of the invention are 
produced as a fusion compound by recombinant DNA techniques, 

20 whereby a length of DNA comprises respective regions encoding the two 
portions of the compound of the invention either adjacent one another or 
separated by a region encodhig a linker peptide wliich does not destroy the 
desired properties of the con5)ound. The benefits in making the 
comypound of the invention using recombinant DNA techniques are several 

25 fold. Firstly, it enables a high degree of precision with which the two 
portions of the compound can be joined together. Secondly, the 
construction of compounds which are "hetero-oligomeric" can be 
controlled by the expression of the difBerent recombinant DNA molecules 
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encoding each of the different type of subnnit of the "hetero-oligomer" in 
the same host cell. 

By "hetero-oligomer" we mean those con:5)onnds in which two or more 
5 different ceD-specific portions are joined to either the same or to different 
subunits which are capable of oligomerisation. The expression, in the 
same host cell of two compounds, of A and B, each with different target 
cell specific portions but with a common second portion capable of 
oligomerisation will result in a mixed population of compounds. For 
10 exanq)le, if the common second portion is capable of dimerisation, three 
potential con5)ounds will be produced: A2, AB and Bj, in a ratio of 1:2:1, 
respectively. 

The separation of the desired compound with each of the different cell' 
15 specific portions, that is AB, can be achieved by two step affinity 
chromatography. 

! Application of the mixture of compounds to an affinity column specific for 
A will result in the binding of A2 and AB, These compounds are eluted 
20 from this first column, and then applied to an affinity column specific for 
B; This will result in AB, but not A2, being bound to the column. 
Finally, the desired product AB, can be eluted. 

Of course, the order in which the affinity columns are used is not 
25 hnportant. 



The same principle of separating those compounds with two or more 
different bmding sites can be applied to the purification of the desired 



I 
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compounds from mixtures of other hetero-oligomers. 



Conceivably, the two portions of the compound may overlap wholly or 
partly. 

5 

Preferably, flie conopound is a multimeric con5)ound such as a whole 
antibody/DNase fusion comprising two light chains and two heavy chains 
(H2L2), a F(ab')2 &sion comprising two light chains and two truncated 
heavy chains (FdjI^, or a Fab fusion comprismg one light cham and one 
10 truncated heavy chain (FdL) . 

The nucleic acid may be expressed in a suitable host to produce a 
polypeptide comprising the compound of the invention. Thus, the nucleic 
acid encoding the compound of the invention or a portion thereof may be 

15 used m accordance with known techniques, appropriately modified in view 
of the teachings contained herein, to construct an e35)ression vector, which 
is then used to transform an 25)propriate host cell for the expression and 
production of the polypeptide of flie invention* Such techniques include 
those disclosed m US Patent Nos. 4,440,859 issued 3 April 1984 to Rutter 

20 et al, 4,530,901 issued 23 July 1985 to Weissman, 4,582,800 issued 15 
April 1986 to Crowl, 4,677,063 issued 30 June 1987 to Mark et al, 
4,678,751 issued 7 July 1987 to Goeddel, 4,704,362 issued 3 November 
1987 to Itakura et oL, 4,710,463 issued 1 December 1987 to Murray, 
4,757,006 issued 12 July 1988 to Toole, Jr. et al 4,766,075 issued 23 

25 August 1988 to Goeddel a/ and 4,810,648 issued 7 March 1989 to 
Stalker, all of which are incorporated herem by reference. 

Where the compound of the invention is multimeric, the constituent chains 
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may be encoded by a single nucleic acid molecule or sqpdiate nucleic acid 
molecule (e^ressed in a common host cell or in different host cells and 
assembled in vitro) . 

5 The nucleic acid encoding the compound of the invention or a portion 
thereof may be jomed to a wide variety of other nucleic acid sequences for 
introduction into an appropriate host. The companion noicleic acid will 
depend \xpon the nature of the host, the manner of the introduction of the 
nucleic acid nito the host, and whether qpisomal maintenance or 
10 integration is desired. 

It will be appreciated that m order to prevent expression of the cytotoxic 
portion of the confound of the invention from killing the host cells in 
which it is expressed, it may be necessary to link (he nucleic acid of the 
15 second aspect of the invention to a signal sequence capable of directing 
secretion of the expressed compound (or portion) out of the host cell. 
Signal sequences will be selected according to the type of host cell used. 
■ Exen^)laiy signal sequences include the ompA signal sequence (for 
example, see Takahara et al.,19S5, J. Biol Qiem. 260(5):2670-2674). 

20 

Generally, the nucleic acid is inserted into an egression vector, such as 
a plasmid, in proper orientation and correct reading frame for expression. 
If necessary, the nucleic acid may be Itoked to the appropriate 
transcriptional and translational regulatory control nucleotide sequences 
25 fecbgmsed''by the desired host, although such controls are generally 
available in die e3q)ression vector. For example, the nucleic acid molecule 
encoding a con5)ound of the invention may be linked to or comprise a 
Kozak consensus ribosome binding sequence (such as GCCGCCACC) to 
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enhance translation. 

The vector is then mtrodnced into the host through standard techniques. 
Generally, not all of the hosts will be transformed by the vector. 
5 Therefore, it will be necessary to select for transformed host cells. One 
selection technique involves incorporating into the egression vector a 
nucleic acid sequence, with any necessary control elements, that codes for 
a selectable trait in tte transformed cell, such as antibiotic resistance. 
Alternatively, the gene for such selectable trait can be on anolher vector, 
10 which is used to co-transform the desired host cell. 

Host cells that have been transformed by (he recombinant nucleic acid of 
the invention are then cultured for a sufficient time and under appropriate 
conditions kQown to those skilled in the art in view of flie teachings 
15 disclosed herein to permit the expression of the polypeptide, which can 
then be recovered. 

Many e:q)ression systems are known, including bacteria (for exanple E. 
coll and Bacillus subtilis), yeasts (for example Saccharomyces cerevisiae 
20 and Pichia pastoris), filamentous fimgi (for example Aspergillus), plant 
cells, animal cells (for tiwsph COS-1, COS-7, CHO, NIH 3T3, NSO and 
BHK cells) and insect cells (for exan5>le Drosophila, SF9 cells). 

Those vectors that include a replicon such as a procaryotic replicon can 
25 also include an appropriate promoter such as a procaryotic promoter 
capable of directing the expression (transcription and translation) of the 
genes in a bacterial host cell, such as E. coli, transformed therewith. 
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A promoter is an expression control element formed by a DNA sequence 
that permits binding of RNA polymerase and transcription to occur. 
Promoter sequences compatible with exemplary bacterial hosts are 
typically provided in plasmid vectors containing convenient restriction 
5 sites for insertion of a DNA segment of the present mvention. 

Typical procaryotic vector plasmids are pUC18, pUC19, pBR322 and 
pBR329 (available from Biorad Laboratories, Ridunond, CA, USA), 
p7rc99A and pKK223-3 (available from Pharmacia Piscataway , NJ, USA) 
10 and the pET system (T7 promoter, Novagen Ltd), 

A typical mammalian cell vector plasmid is pSVL available from 
Pharmacia, Piscataway, . NJ, USA. This vector uses the SV40 late 
promoter to drive expression of cloned genes, the highest level of 
15 expression being found in T antigen-producing cells, sudi as COS-1 cells. 

An example of an inducible mammalian expression vector is pMSG, also 
available from Pharmacia. This vector uses the glucocorticoid-mducible 
promoter of the mouse mammary tumour virus long terminal repeat to 
20 drive expression of the cloned gene, 

Usefrd yeast plasmid vectors are pRS403-406 and pRS413-416 and are 
generally avaflable from Stratagene Cloning Systems, La JoUa, CA 92037, 
USA. Plasmids pRS403, pRS404, pRS405 and pRS406 are Yeast 
25 Integrating plasmids (Yips) and incoiporate the yeast selectable markers 
his3, trpl, leu2 and ura3. Plasmids pRS413-416 are Yeast Centromere 
plasmids (YCps). 
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Further useful vectors for transformation of yeast cells, such as Pichia, 
iudude the 2ii plasmid p YX243 (available from R and D Systems Limited) 
and llxe integrating vector pPICZ series (available from Invitrogen). 

5 A variety of methods have been developed to operatively Iwk DNA to 
vectors via complementary cohesive termini. For instance, 
complementary homopolymer tracts can be added to the DNA segment to 
be inserted to the vector DNA. The vector and DNA segment are then 
joined by hydrogen bonding between the con5)lementary honaopolymeric 

10 tails to form recombinant DNA molecules. 

Synthetic Hnkers containing one or more restriction sites provide an 
alternative method of joining the DNA segment to vectors. The DNA 
segment, generated by endonuclease restriction digestion as described 
15 earlier, is treated with bacteriophage T4 DNA polymerase or E. coli DNA 
polymerase I, enzymes that remove protrudmg, 3'-shigle-stranded termini 
with fheh 3'-5'-exonucleolytic activities, and fill in recessed 3'-ends with 
fheir polymerizing activities. 

20 The combmation of fliese activities therefore generates blunt-ended DNA 
segments. The blunt-ended segments are then mcubated with a large 
molar excess of linker molecules in the presence of an enzyme that is able 
to catalyze the ligation of blunt-ended DNA molecules, such as 
bacteriophage T4 DNA ligase. Thus, the products of the reaction are 

25 DNA segments carrying polymeric Imker sequences at their ends. These 
DNA segments are then cleaved with the appropriate restriction enzyme 
and ligated to an e^qpression vector that has been cleaved with an enzyme 
that produces termini compatible with those of die DNA segment. 
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Synthetic linkers containing a variety of restriction endomiclease sites are 
commercially available from a number of sources including International 
Biotechnologies Inc, New Haven, CN, USA. 

5 

A desirable way to modify the nucleic acid encoding the compound of the 
inventipn or a portion thereof is to use the polymerase chain reaction as 
disclosed by Saiki et al (1988) Science 239, 487-491 . 

10 In this method flie nucleic acid to be enzymatically amplified is flanked by 
two specific oligonucleotide primers which themselves become 
incorporated into the amplified nucleic acid. The said specific prnners 
may contam restriction endonuclease recognition sites which can be used 
for cloning into expression vectors using methods known in the art. 

15 

Exemplary genera of yeast contemplated to be useful in the practice of the 
present invention are Pichia, Saccharomyces, KluyveromyceSy Candida, 
Torulopsis, Hansenula, Schizosaccharomyces, Citeromyces, Pachysolen, 
Debaromyces, Metschunikowia, Rhodosporidium, Leucosporidium, 

20 Botryoascus, Sporidiobolus, Endomycopsis, and the like. Preferred 
genera are those selected from the group consisting of Pichia, 
Saccharomyces, Kluyveromyces, Yarroma and Hansenula. Examples of 
Saccharomyces are Saccharomyces cerevisiae^ Saccharomyces italicus and 
Saccharomyces rouxii. Exanq)les of Kluyveromyces are Kluyveromyces 

25 Jragilis and Kluyveromyces lactis. Examples of Hansemda are Hansenula 
polymorpha, Hansenula anomala and Hansenula capsvlata. Yarrowia 
lipolytica is an example of a suitable Yarrowia species. 
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Methods for the transfonnation of 5. cerevisiae are tau^t generafly in EP 
251 744, EP 258 067 and WO 90/01063, all of which are incorporated 
herem by reference. 

5 Suitable promoters for S. cerevisiae include those associated with the 
PGKl gene, GALl or GALIO genes, CYCl, PH05, TRPl, ADHl, ADH2, 
the genes for glyceraldehyde-3-phosphate dehydrogenase, hexoldnase, 
pymvate decarboxylase, phosphofiuctokinase, triose phosphate isomerase, 
phosphoglucose isomerase, glucokinase, a-mating factor pheromone, a- 
10 mating fector pheromone, the PRBl promoter, the GVT2 promoter, and 
hybrid promoters mvolvmg hybrids of parts of 5' regulatory regions with 
parts of 5' regulatory regions of other promoters or with upstream 
activation sites (e.g. the promoter of EP-A-258 067). 

15 The transaiption tennination signal is preferably the 3' flanking sequence 
of a eukaiyotic gene which contains proper signals for transcription 
termination and polyadaiylation. Suitable 3' flanking sequences may, for 
example, be those of the gene naturally linked to the expression control 
sequence used, i.e. may correspond to the promoter. Alternatively, they 

20 may be different ia which case the termination signal of the S. cerevisiae 
AHDl gene is preferred. 

The present invention also relates to a host cell transformed with a 
polynucleotide vector construct of the present invention. The host cell can 
25 be either procaryotic or eukaryotic. Bacterial cells are preferred 
procaryotic host ceUs and typically are a strain of E, coli such as, for 
example, the E. coU strains DH5 available from Bethesda Research 
Laboratories Inc., Bethesda, MD, USA, and RRl available from the 
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AmericaB Type Culture Collection (ATCC) of Rockville, MD; USA (No 
ATCC 31343). Preferred eukaryotic host cells include yeast and 
mammalian cells, preferably vertebrate cells such as those from a mouse, 
rat, monkey or human fibroblastic cell line. Preferred eukaryotic host 
5 cells include Chinese hamster ovary (CHO) cells available from the ATCC 
as CCL61, NIH Swiss mouse embryo cells NIH/3T3 available from the 
ATCC as CRL 1658 and monkey kidney-derived COS-1 cells available 
from the ATCC as CRL 1650 or WS0 cells. 



10 Transformation of appropriate cell hosts with a nucleic acid constructs of 
the present invention is accomplished by well known methods that 
typically depend on the type of vector used. With regard to 
transformation of procaryotic host cells, see, for example,, Cohen et al, 
Proc. Natl Acad. ScL USA, 69: 2110 (1972); and Sambrook a/, 

15 Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY (1989). Transformation of yeast 
cells is described in Sherman et al. Methods In Yeast Genetics, A 
Laboratory Manual, Cold Spring Harbor, NY (1986). The method of 
Beggs, Nature, 275: 104-109 (1978) is also useful. With regard to 

20 vertebrate cells, reagents useful in transfecting such cells, for example 
calcium phosphate and DEAE-dextran or liposome formulations, are 
available from Stratagene Qoning Systems, or Life Technologies Inc, 
Gaithersburg, MD 20877, USA. 

25 Successfully transformed cells, Le. cells that contain a nucleic acid 
construct of the present invention, can be identified by well known 
techniques. For txmplOy cells resulting from the introduction of an 
e;q)ression constract of the present invention can be grown to produce the 
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polypq)tide of the invention. Cells can be harvested and lysed and their 
DNA content examined for the presence of the DNA using a me&od such 
as that described by Southern, J. MoL Biol , 98: 503 (1975) or Berent et 
aU Biotech., 3: 208 (1985). Alternatively, the presence of the protein in 
5 the supernatant can be detected using antibodies as described below. 

In addition to directly assayiag for the presence of recombinant nucleic 
acid, successful transformation can be confirmed by well laiown 
immunological methods when the recombinant nucleic acid is capable of 
10 directing the expression of the protein. For example, cells successfully 
transformed with an egression vector produce proteins displaying 
appropriate antigenicity. Samples of cells suspected of being transformed 
are harvested and assayed for the proteiu using suitable antibodies. 

15 Thus, in addition to the transformed host cells themselves, the present 
invention also contemplates a culture of those cells, preferably a 
monoclonal (clonaUy homogeneous) culture, or a culture derived firom a 
monoclonal culture, in a nutrient medium. Preferably, the culture also 
contaios the protem. 

20 

Nutrient media useful for culturing transformed host cells are well known 
in the art and can be obtained from several coimnercial sources. 

A third aspect of the invention provides a vector comprising a nucleic acid 
25 according to the second aspect of the invention. 

A fourth aspect of the invention provides a host cell coinprising a vector 
according to the third aspect of the invention. 
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Preferably, the host cell is a mammalian cell. 
More preferably the host ceU is NSO or CHO. 

.5 

A fifth aspect of the inventioii provides a pharmaceutical composition 
comprising a compomid according to the first aspect of the invention and 
a pharmaceutically acceptable carrier. 

10 The compounds and compositions of the invention are administered in any 
suitable way, usually parenterally, for example intravenously, 
intraperitoneally or, preferably (for bladder cancer), intravesically 
(i.e. into the bladder), in standard sterile, non-pyrogenic formulations of 
diluents and carriers, for example isotonic saline (when administered 

15 intravenously). 

A sixth aspect of the mvenlion provides a compound accordmg to the first 
aspect of the invention for use in medicine. 

20 The compounds and compositions of the invention may be used to treat a 
patient with any disease mvolving a dysfunction of a population of cells 
ejqjressmg PEM, said compounds and compositions selectively targeting 
and destroying said population of cells within a patient. For exan5)le, said 
con5)ounds and con5)ositions may be used in the treatment of cancer, e.g. 

25 cancer of the breast, ovaries, lung, stomach, intestines, blood etc. Thus, 
anti-tumour cell antigen antibodies can be used to deliver a cjrtotoxic 
portion with endonuclease activity to a tumour cell. Antibodies that are 
internalised upon contact with the target antigen are used, such that the 
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cytotDxic portion enters the cytosol of the tumonr cell, where it can trigger 
cell death. 

In principle, the compounds and compositions of the invention may be 
5 used to treat any mammal, including pets such as dogs and cats and 
agriculturally important animals such as cows, horses, sheep and pigs. 



Preferably, the patient is human, 

10 A seventh aspect of the invention provides the use of a compound 
accordmg to &st aspect of the invention m the preparation of a 
medicament for treating a mammal having said target cells to be 
destroyed. 

15 Preferably, flie medicament is for treating cancer, such as ovarian cancer. 

A eighth aspect of the invention provides a method of treating a mammal 
^ having target cells to be destroyed, the method con^rising administering 
a compound according to the first aspect of the mvention to said mammal. 

20 

In a preferred embodiment of the seventh and eighth aspects of the 
invention, die mammal is a human. 

Preferably, flie target cells to be destroyed are cancer cells. More 
25 preferably, the cancer cells are epithelial cancer cells, such as ovarian, 
gastric, colorectal and/or pancreatic cancer cells. Most preferably, the 
cancer cells are ovarian cancer cells. 
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The invention will now be described in detail with reference to the 
following figures and exanq)les: 

Figure 1 shows the conq)lete coding sequence of human DNAse I. 

5 

Figure 2 shows (A) the mature DNAse peptide I sequence used in the 
exemplary Ab-DNase and Fab-DNase constructs, and (B) a truncated 
DNAse pq)tide I sequence encoded by a nucleotide sequence comprising 
a Kozak sequence (underlined). 

10 

Figure 3 shows (A) the nucleotide sequence encoding the humanised 
HMFGl light chain including leader peptide, (B) the nucleotide sequence 
of (A) furiher comprising a Kozak sequence (underlined), (C) the amino 
acid sequence of the humanised HMFGl light chain including leader 
IS peptide (shaded) and (D) the nucleotide sequence encoding the humanised 
HMFGl heavy chain including leader peptide, 

Figure 4 shows the linker and hinge-linker oligonucleotides used in (A) 
- the whole antibody-DNase and (B) the Fd-DNase exemplary constructs, 
20 Note, in Figure 4(A) a deletion of one or more codons between the 
HMFGl hinge and the linker is represented as aG. 

' Figure 5 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS23 comprising a leader sequence 
25 ^ (underlined) and a linker sequence (double-underlined). Figure 5(C) 
shows the nucleotide sequence of (B) further comprising a Kozak sequence 
(underlined). Figure (D) shows the amino acid sequence of a humanised 
HMFG-1 Fd/DNase I fusion. 
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Figure 6 shows (A), (B) and (C) shows the nucleotide sequences of Figure 
5 (A), (B) aud (C), respectively, further comprising an SV40 NLS (double 
underlined) (pAS27). Figure (D) shows the amino acid sequence of a 
5 humanised HMFG-1 Fd/DNase I fusion comprising an SV40 NLS (double 
underlined). 

Figure 7 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of an exemplary HMFG-1 heavy chain/DNase I fusion 
10 pAS34 (as used in *Ab-DNase* m Example 2), comprising a leader 
sequence (underlined) and a linker sequence (double-underlined). 

Figure 8 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of an exemplary HMFG-1 heavy chain/DNase I fusion 
15 pAS35, con?)rising a leader sequence (underlmed) and a Imker sequence 
(double-underlined). The lower case 'g' represents a silent mutation 
caused by PGR aniplification. 

Figure 9 shows (A) the nucleotide sequence and (B) the translated ammo 
20 acid sequence of an exemplary HMFG-1 heavy chaio/DNase I fusion 
pAS36, comprising a leader sequence (underlmed) and a Imker sequence 
(double-underlmed). The lower case 'c' represents a silent mutation 
caused by PGR an5)lification. 

25 Figure 10 shows (A) the nucleotide sequence and (B) the translated ammo 
acid sequence of an exemplary HMFG-1 heavy chain/DNase I fusion 
pAS37, comprising a leader sequence (underlined), a linker sequence 
(double-underlined) and an NLS sequence (triple underlined). 
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Figure 11 shows (A) the micleotide sequence and (B) the translated amino 
acid sequence of an exemplary HMFG-1 heavy chain/DNase I fusion 
pAS38, con5)rising a leader sequence (underlined), a linker sequence 
5 (double-underlined) and an NLS sequence (triple underlined). The lower 
case 'g* represents a silent mutation caused by PGR amplification. 

Figure 12 shows (A) the nucleotide sequence and (B) the translated amino 
acid sequence of an exemplary HMFG-1 heavy chain/DNase I fiision 
10 pAS39, comprising a leader sequence (underlined), a linker sequence 
(double-underluied) and an NLS sequence (triple underlined). The lower 
case *c' represents a silent mutation caused by PGR amplification. 

Figure 13 shows nucleotide sequences (A and B) encoding a humanised 
15 HMFG-1 Fd/DNase I ftision pASlOl comprising a short leader sequence 
(underlined) and a linker sequence (double-underlined). Figure 13(G) 
shows the nucleotide sequence of (B) further comprising a Kozak sequence 

i 

(underlined). Figure (D) shows the amino acid sequence of a humanised 
HMFG-1 Fd/DNase I fusion. 

20 

Figure 14 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS102 comprisiug a leader sequence 
(underlined) and a hybrid hinge + linker sequence (double-underlined). 

Figure 14(C) shows the nucleotide sequence of (B) further comprising a 
25 Kozak sequence (underlined) (construct designated pAS302 in Example 2). 

Figure (D) shows the amino acid sequence of a humanised HMFG-1 
Fd/DNase I fusion. 



I 
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Figure 15 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pASlOS comprisiug a leader sequence 
(underlined) and a hybrid hinge + short linker sequence (double- 
underliaed). Figure 15(C) shows the nucleotide sequence of (B) further 
5 comprising a Kozak sequence (underlined). Figure (D) shows the amino 
acid sequence of a humanised HMFG-1 Fd/DNase I fusion. 

Figure 16 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS104 comprising a leader sequence 
10 (underHned) and a hybrid hinge + mutated short linker sequence (double- 
underlined). Figure (C) shows the amino acid sequence of a humanised 
HMFG-1 Fd/DNase I fusion. Mutations (compared to pAS103) at 
positions 775 and 924 are shaded. 

15 Figure 17 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS105 comprising a leader sequence 
(underlined), a short linker sequence (double-underlined) and an NLS 
j sequence (triple underimed). Figure 17(C) shows the nucleotide sequence 
of (B) further comprismg a Kozak sequence (underlined). Figure (D) 

20 shows the amino acid sequence of a humanised HMFG-1 Fd/DNase I 
fusion. 

Figure 18 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS106 comprising a leader sequence 
25 (underlined), a hybrid hinge + linker sequence (double-underlined) and 
an NLS sequence (triple underlined). Figure 18(C) shows flie nucleotide 
sequence of (B) further comprising a Kozak sequence (underlined). 
Figure (D) shows the ammo acid sequence of a humanised HMFG-1 
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Fd/DNase I fusion. 

Figure 19 shows nucleotide sequences (A and B) encoding a humanised 
HMFG-1 Fd/DNase I fusion pAS107 comprising a leader sequence 
5 (underlined), a hybrid hinge + short linker sequence (double-imderlined) 
and an NLS sequence (triple imderlined). Figure 19(C) shows the 
nucleotide sequence of (B) further comprising a Kozak sequence 
(underlined). Figure (D) shows the amino acid sequence of a humanised 
HMFG-1 Fd/DNase I fusion. 

10 

Figure 20 shows a schematic diagram of the pEE6 e5q)ression vector used 
in the exemplary constructs. 

Figure 21 shows autoradiographs from immuno-prec5)itation experiments 
15 with metabolically labelled transient transfectants: 

GELA 

Lane 1 shows the precipitation of supernatant from mock-transfected 
cells, 

20 Lane 2 is from cells transfected with hHMFG-1 (construct 6) giving 

e^spected molecular weig^hts of about 51.2 and 26.4 kDa for the 
heavy and light chains, respectively. 

Lane 3 shows construct 34 antibody construct which has human 
DNase I fused to the C-terminus of the heavy chain gene. As 
25 expected, the size of the heavy chain gene has increased to about 

80.7 kDa, 

Sanoples from whole antibody DNase I constructs 35, 36 and 39 
were run on the gel (Lanes 4 to 6) but were not sufficiently well 
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expressed to be visible, in this experiment. 
In subsequent e3q)eriments using this method, construct 39 was 
detectable but weak, and constructs 35 and 36 were detectable but 
very weak. Constructs 37 and 38 have not been tested in this assay 

5 system. 

Lanes 8 to 10 are fusion of humanised HMFGl F(ab*)2 witli hmnan 
DNase I (constructs 41, 23 and 102, respectively). F(ab')2 alone was 
included in this set of experiments (lane 7, construct 41) but did not 
e3q)ress, this was included in later e:q)eriments (see gels C and D). 

10 In addition to the light chain (about 26.4 kDa) and the Fd-DNAse 

I fusion (about 56.6 kDa), a third major band is observed at around 
40 kDa. Interestingly, this band is observed in the humanised 
HMFG-1 fusions but not in the antibody alone. Since an anti-F(ab02 
antibody was used for immuno-precq)itation, it is unlikely that this 

15 can be proteolysis between nnmunoglobulin and DNase I sequence. 

It probably rq)resents a population of polypeptide produced by 
premature transcriptional termination (due to DNase I sequence in 
the 3*-end of the fasion mRNA). 

20 GELB 

This is Ihe non-reducing gel counterpart to gel A, described above. 
I^e 1 is the mock-transfected control cells and lanes 2 and 3 are 
from flie cells transfected wifli humanised HMFGl alone (construct 
6) and the humanised HMFG-1 fused at flie C-^terminus to human 

25 DNase I, respectively. As before, lanes 4 to 6 are from cell 

supematants from cells transfected with constructs 35, 36 and 39. 
The gd shows that both the whole antibody and the antibody-DNase 
I fusion are assembled, with the DNase fusion giving a higher 



wo 01/74905 



PCT/GBOl/01324 



34 

molecular weight coinpared to the antibody alone. 

Figure 22 shows a typical standard curve used to determine the 
concentration of PDTRP-binding material in the supematants of transiently 
5 transfected L761h cells. Each point on the curve has been determined 
twice, 

Figure 23 shows typical standard curves used to determine the 
concentration of bovine DNAse L 

10 

Figure 24 shows corrected DNase I activity in transiently e^qpressed 
humanised HMFGl whole antibody-hnman DNAse I fusions (i.e. pAS34, 
pAS34, pAS35 and pAS6[controll). 

15 Figure 25 shows the corrected DNAse I activity in transiently expressed 
humanised HMFGl F(ab')2-human DNase I fusions {Le. pASlOl, 
I pAS102, pAS103 and pAS41 [control]). 

Figure 26 shows results of the cytotoxicity assay. 

20 

Figure 27 shows the % of MCF7 ceUs killed after mcubation with the 
exemplary constracts. 

Figure 28 shows a schematic diagram of (A) Ab-DNase and (B) Fab- 
25 DNase. 

Figure 29 diows a schematic diagram of vector pAS34K encoding Ab- 
DNase {i.e. pAS34 as shown in Figure 7b plus Kozak sequence). 
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Figure 30 shows a schematic diagram of vector pAS302 encoding Fab- 
DNase. 

5 Figure 31 shows (A) the elution profile fi:om Protein-L colimm and (B) 
size exclusion chiomatogram for Fab-DNase. 

Figure 32 shows (A) the elution profile firom Protein-A cohiron and (B) 
size exclusion chromatogram for Ab-DNase. 

10 

Figure 33 shows the SDS-PAGE stained gels for (A) Ab-DNase and (B) 
Fab-DNase. 

Figure 34 shows (A) standard curve for bovine DNase concentration AND 
15 ^) DNase activity measurements at 3 hours and 6 hours. 

Figure 35 shows (A) PEM e^qpression on OVCAR 3 and A375 cells, as 
■ measured by EUSA using hHMFG-1 and AD-DNase antibodies, and (B) 
cytotoxicity measurements. 



20 
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EXAMPLES 

Example 1 

5 (A) Mammalian expression of humanised HMFGl-DNase constnicts 

The hraoan HMFGl light and heavy chain (with or without engineering 
a fusion to human DNase I), were cloned mto the pEE6 expression vector 
system for expression in mammalian CHO or myeloid NSO cells (see 

10 figure 20). The vector system was originally developed by Celltech Ltd 
(UK) arid is now owned by al-Lonza (see Young & Owens, 1994, /. 
IrnrnunoL Meth. 168:149-165). The vector consists of two human 
cytomegalovirus promoters (hCMV) for both the heavy and light chain 
genes. Each transcription unit is completed by Ifae poly-adenylation signal 

15 (pA) with an optional immunoglobulin terminator sequence (Ig term.) 
located between the heavy and li^t chain transcription units. Propagation 
in E.coli can be selected for by the presence on an ampiciQin resistance 
gene (not shown in Fig 20) . The inclusion of a glutamine synthetase gene 
(GS) in the vector allows the stable NSO transfectomas to be selected by 

20 growth in glutamme free media, since NSO cells are GS" and cannot 
otherwise grow in glutamine jfree media. 

Exemplary humanized HMFGl-DNAse I fusion constructs of the 
invention are detailed in figures 5 to 19. 

25 

(B) Immuno-precipitation of metabolically labelled transient transfectants 
CHO-L761h cells (Cockett et al, 1990, Nuc. Acids Res. 19:319-325) 
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were transfected, according to the modificatiorL of Gorman et al, 1985), 
with expression vectors containiag either whole HMFGl antibody or 
F(ab')2 ftagment of the antibody along with the various fusion constructs 
of their respective heavy chains and human DNase I. The cells were then 
5 incubated with either 50 fiCi ^^S methione for 72 h in methionine-free 
medium. Secreted product was precipitated with a rabbit anti-human 
F(ab02 antibody bound to protein A Sepharose. Bound material was 
eluted in either reducing or non-reducing SDS-PAGE loading buffer and 
run on gels. The autioradiographs (see Figure 21) above were generated 
10 from those gels after drying them. 

(O Estimation of the efficiency of DNase constructs in supematants 
Introduction 

15 

This set of experiments was designed to standardise the amount of 
construct in a given DNase I activity assay and to allow us to commeoot on 
the amount of activity a particular construct possesses. Given that the 
antibody-DNase I fusions are so different to the F(ab')2-I>Nase I fusions 
20 it is best not to compare the two groups. Once we have purified the 
protein, we will have a better idea of the exact molecular configuration of 
all species. Then, and only then, will it be sensible to compare amongst 
groups. 

25 Determination of concentration of constructs 

Hie concentration of constructs in supematants from transiently 
transfected L761H cells was determined m a PDTRP-binding ELISA. To 
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each well of a Maxisorb 96-well ELISA plate (Nunc) was added 100 ;tl of 
carbonate buffer containing 100 ng of recombinant GST-(PDTRP)7 fusion 
protein (Gendler et ah, 1990, J. Mol Biol 265:15286-93). After 
overnight binding at 4'*C, the plate was washed three times in PBS-Tween 
5 {Le. PBS containing 0.05% Tween-20). The plate was then blocked with 
three 3-minute washes of PBS-Tween containing 1 % BSA. 

For each constract, 100 jA of supernatant was added to a weU on the plate. 
In addition, hHMFG-1 of known concentration was serially diluted down 

10 the plate using doubling dilutions in 100 lA of PBS-Tween per well. The 
plate was incubated for a further 1 h at SC'C, then 200 ng of MC135 anti- 
human kappa light chain antibody (binding site) in 100 pA of PBS-Tween 
was added to each well for 1 h at 30''C. After three 3-minute washes in 
PBS-Tween, 100 /il of anti-mouse IgG-peroxidase conjugate (Jackson 315- 

15 035-045), diluted 1:2000 ia PBS-Tween, was added to each well and 
incubated for 1 h at 30°C. Following a final set of fliree 3-minute washes 
ui PBS-Tween, 100 /xl of TMB substrate (Sigma) was added to each well 
of the plate and, after a colour developed, the optical density at 630 nm 
of the solution m each well of the plate was determined 

20 

Results 

(see Figure 22) , 

25 (D) Corrected bovine DNase I standard curves and DNase assay 

DNase activity was determined usmg a modification of the methyl green- 
DNA complex degradation method (Sinicropi et al., 1994, Analyt. 
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Biochem. 222:351-358). Briefly, a 1:1 solution of the assay buffer and 
methyl green-salmoii sperm DNA complex was mixed together to give a 
total volume of 0.2 ml. To this, 0. 1 ml of tissue culture siq)erQataiit from 
transiently transfected CHO-L7611i cells was added and the mixture 
5 incubated at ST'C. DNA cleavage by DNase results in a reduction in 
absorbance at 620 ran. Figure 23 shows a standard curve produced with 
various concentrations of bovine DNase I over a number a time point. 



Figures 24 and 25 show DNAse activity for the whole HMFGl antibody- 
10 and F(ab02 - DNase fusions, respectively. 

(E) Cytotoxicity of DNAse constructs 

Method 

15 

DNase constructs were transfected into CHO L7611i cells using a calcium 
phosphate c(>-prec5)itation method (Gorman et al. , 1985, In: DNA cloning 
^ (2nd edition), Glover A(ed.), Academic Press, NY, 163-188). Included 
in the experiment were negative controls, consisting of cells transfected 
20 with TE buffer alone or widi TE buffer and pEE6 expression vector. In 
addition to these controls, vectors that express hHMFG-1 (pAS6) and 
F(ab')2 of hHMFGl (both with specificity for PEM but without DNase I) 
were included. 



25 The supernatant from these cells was harvested after 72 h of expression, 
followed by centrifugation to remove dead cells. MCF-7 cells were 
incubated for 1 h at 37°C with an aliquot of each of these supematants. 
The amount of cellular lactate dehydrogenase (LDH) released from the 
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MCF-7 cells due to the C3^toxicity of the supernatant was determined 
using the CytoTox96 cytotoxic assay kit (Promega). Total lysis ('total 
LDH') was determined by measuring the target cell maximum LDH 
release using the kits lysis solution. The percentage of cells killed was 
5 then calculated as the proportion of the LDH released to the total LDH 
released. For each construct, the cytotoxicity assay was performed in 
quadruplicate, except for assay of pAS38 and 39, which were performed 
in triplicate. The values of LDH release for each construct were 
compared against either F(ab')2 or whole antibody, or each other, using 
10 a one-tailed t-test in Excel. 

Results 

Figures 26 and 27 shows that there is negligible cell killing with either 
15 pAS6 (HMFGl alone) or with pAS41 (F(ab')2 alone). All of the 
hHMFGl F(ab')2-DNase I constructs kill significandy more cells ttian the 
I F(ab02 fragment alone (p< 0.00193) and all of the antibody-DNase I 
constructs kill significantly more cells than antibody alone (p < 0.00783), 
except for perhaps pAS34 (p < 0.021). 

20 

(F) Use of the DNase-I/huHMFG-1 Fab fusion protein in the treatment of 
ovarian cancer 

Patients diagnosed with ovarian cancer are treated by mtravenous injection 
25 of the DNaseI/huHMFG-1 Fab fusion protem. Typically, a dose of 
between 1 to 100 mg will be administered weekly. 

Therapeutic response is measured by the normal clmical procedures that 
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are well known in the art, for example radio-imaging methods. 
Example 2 

5 (A) MamrnaliaTi expression of hmnamsed HMFG-1 / DNase constmcts 

In a second series of e:q)erimeDts, two fimher hmnanised HMFG-l/Dnase 
constructs were expressed in mammalian cells. The first constract 
encoded a fusion protein a con5)lete hHMFG-1 antibody fused wifli human 
10 DNase, designated 'Ad-DNase'. The second construct encoded a fusion 
protein a Fab fiagment of the hHMFG-1 antibody fused with human 
DNase, designated 'Fab-DNase'. Ad-Dnase and Fab-DNase are shown 
schematically in Figure 28. 

15 Ad-DNase con5)rises an HMFG-1 light cham as shown in Figure 3(c) and 
an HMFG-1 heavy chain/DNase fusion as shown in Figure 7(b). 

5 Fab-DNase comprises an HMFG-1 light diain as shown in Figure 3(c) and 
an HMFG-1 Fd chain/DNase fusion as shown in Figure 14(d). 

20 

The human HMFGl heavy and light chain constructs were cloned into the 
pEE6 e3q)ression vector system for e35)ression in mammalian CHO or 
myeloid NSOcdls, as described in Section (A) of Eran^lel. This vector 
consists of two human cytomegalovirus promoters (hCMV) for both the 
25 heavy and light chain genes. Each transcription unit is completed by the 
poly-adenylation signal (pA) with an optional immunoglobulin terminator 
sequence (Ig term.) located between the heavy and light chain 
transcription units. The vectors also con?)rise a 5'-UT Kozak sequence 
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(to enhance translation of the mRNA) and an ATG initiator codon 
upstream of both heavy and light chains. 

The vectors encoding Ad-Dnase and Fab-DNase, designated pAS34K and 
5 pAS 302 respectively, are shown schematically in Figure 32. 

Propagation in E.coli can be selected for by the presence on an ampicillin 
resistance gene. The inclusion of a glutamine synthetase gene (GS) ia the 
vector allows the stable NSO transfectomas to be selected by growth in 
10 glutamine free media, since NSO cells are GS" and cannot otherwise grow 
m glutamine free media. 

These plasmids were co-transfected with a vector containing a neomycin 
resistance gene into CHO cells- Stable cell lines were generated for each 
15 of the constructs. 

Clones were selected that expressed DNase activity and antigen (PEM)- 
bmding activity. 

20 (B) Purification of hHMFG-l/DNase constructs 

The cells were routinely grown in: 



DMEM (Gibco 10938-025) 500 ml 

25 Non essential amino acids (Sigma M7145) 5 ml 

Sodium pyruvate (Sigma S8636) 5 ml 

Glutamme (G7513) 5 ml 

Heat inactivated foetal calf serum 50 ml 



wo 01/74905 



PCT/GBOl/01324 



43 

Incubation was carried out at 37 in 5 % CO2. 

For prodoction of the Ab-DNase fusion protein, W70 cells (CHO cells 
5 transfected with pAS34K) were maintained in flats and grown to 
confluency in T175 flasks. Each T175 flask was spUt between two 
850 cm^ roller bottles containing 100 ml of flie aforementioned growth 
media. Each roller bottle was gassed with an 95 % air 5 % CO2 mix for 1 
mhrate and flien sealed. They were rolled at a rate of 0.5 rpm and were 
10 gassed every other day as described earlier until the cultures were 
confluent. At this stage the medinm was removed and 200 ml of harvest 
medium was replaced on the culture. This was the same medium but 
contained 2 mM sodium butyrate (with or without 10% heat mactivated 
FCS). The cells were then grown for a further 3-4 days before they were 
15 harvested. The medium was coDected from the cells and dead cells were 
removed ftom the medium by centrifogation at 5000 rpm for 30 mins at 
40c. The spun medium (supernatant) was then filtered through a 0.2 
micron filter unit, prior to applying to the afBnity chromatography 
column. 

20 

The Fab-DNase fiision product was then purified by affinity 
chromatogr^hy uang a Protein-L column (Protein L agarose, P3351 from 
Sigma Co, Poole, Dorset, XJK), as follows: 

25 1. Wash 1 ml of settled protein L agarose (P3351) wifli at least 5 
volumes of phosphate buffered saline (PBS: 10 mM phosphate 
buffered salme, pH 7.4). 
2. Dflute 1 ml supernatant with 9 ml PBS. 
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3. Mix diluted supernatant with protein-L agarose and incubate with 
gentle end over end mixing for 1 hour at room temperature. 

4. Pack the slurry in a column and drain. 

5 . Wash away unbound proteios with 10-15 column volumes of PBS . 
5 6. Elute bound protein withSml elution buffer (0.1 M glycine, 

pH 2.0, or 0.2 M citrate buffer, pH2.8). 
7, Neutralise eluted material with Tris-base to achieve pH 7.5. 

Figure 31(a) shows the elution profile of the Fab-DNase from flie Protein- 
10 L column when eluted wifli 0. 1 M glycine, pH 2.0. 

Following purification, Fab-DNase was analysed by analytical size- 
exclusion chromatography on a Superdex-200 column. 

15 Figure 31(b) shows the size-exclusion chromatogram obtained for the Fab- 
DNase. 

^ The Ab-DNase fusion product was purified by affinity chromatography 
usmg a Protein-A sepharose column, as follows: 

20 

1 . 25 ml of protein A sepharose fast flow resin (Amersham Pharmacia 
Biotech) in an XK26 column (Amersham Pharmacia Biotech) was 
equihT>rated in O.IM glycine, pH8.8, 0.5M NaCI. 

2. Approximately 2 litres of sterile-filtered supernatant from cell line 
25 W70 (CHO cell line making 34K) was passed the column overnight 

, at a low flow rate (1-2 ml/min). 

3. The column was then washed down to base-line and was re- 
equilibrated in 0.15M disodium hydrogen phosphate, pH9.0 and the 
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bound 34K was eluted by running a gradient between this buffer 
(A) and a low pH buffer (B) which consisted of O.IM citric acid, 
pH2.0. siq)plemented to 2 mM calcium chloride and 2 mM 
magnesium sulphate. The gradient was run over 100 ml at a flow 
5 rate of 4 ml/min and a further 50 ml of buffer B was run over the 

column at the completion of the gradient, also at 4 ml/min. 

4. During the 100 ml gradient and the last 50 ml of buffer A fractions 
were collected. The peak fractions were identified and pooled and 
dialysed against 4 litres of 25 mM Hepes, pH7.5, 0.2 M NaCl, 

10 ImM calcium chloride and ImM magnesium sulphate. Dialysis 

was performed overnight at 4C. 

5. The dialysate was concentrated on Centricon spin concentrators to 
a final concentration of 6-13 mg/ml. The concentration was 
determined by dividing by its extinction coefficient of 1.558 

15 (calculated from the known sequence). 

Figure 32(a) shows the ehition profile of the Ab-DNase from the Protein- 

i 

L column when eluted with a gradient of 0,15 M Na2HP04, pH 9.0 to 
0.1 M citric acid, pH 2.0 contaming 2mM each of CaCl^ and MgCli- 

20 

Figure 32(b) shows the size-exdusioa chromatogram obtained for the Ab- 
DNase. 

(C) Determination of concentration of fusion protems 

25 

Prior to measuring DNase activity of the purified fusion proteins (see 
Section (E) below), the concentration of the protems was determined by 
EUSA, as fottows (see also Section (Q of Example 1). 
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Materials 

1. 96 WeU ELKA plates (Nunc F96 Maxisorp Cat No. 442404). 

2. Bovine serum albumin (Sigma A-9647). 

3. Coating buffer (Na2C03 1.59 g/1, NaHCOS 2.93 g/1, NaN3 0.2 
g/1, pH9.6. 

4. GST-MUC1-7TR antigen (1.5 mg/ml). 

5. Anti-hmnan kappa light-chain antibody GD12 (0:2 mg/ml, 
Binding Site, MC135). 

6. Peioxidase-conjugated rabbit anti-mouse IgG (Jackson, 315-035- 
045). 

7. TMB- substrate buffer (Sigma P-4417). 

8. Tween 20 (Sigma P7949). 

9. Purified humanised HMFGl (1 .4 mg/ml). 

Method 

Note all washes in this protocol consist of 3 x 3 min washes in PBS buffer 
20 (note: all PBS buffer contained 0.05 % Tween) and the plate was 
iucnbated in a lunch box containing moist tissue paper. 

1. Coat 100 ng of andgen/100 fd coating buffer/well overnight at 
4*»C. 

25 2. Wash the plate and block each well with 100 fi\ of PBS 

containing 0.05 % Tween, and 1% BSA for 1 h at 30oC. Wash 
plate afterwards. 

3. A standard curve of humanised HMFGl should be prepared 



10 
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down the plate using doubling dilutions. Make each dilution in 
100 111 PBS buffer and for the highest concentration in the 
curve use 1000 ng of antibody. 

4. Incubate the plate for 2 h at 30°C, wash, and add 100 jul PBS 
5 containing 200 ng of the anti-human Kappa light chain antibody 

to each well of the plate. Incubate for a farther 1 h at 30°C 
and then wash the plate. 

5. Add 100 lA PBS contaming flie rabbit anti-mouse IgG- 
peroxidase conjugate (diluted 1:2000) to each well of the plate 

10 and mcubate for 30 mm at 30°C. Washlhe plate and add 100 

fd TMB- substrate^buffer to each well of die plate and allow 
the reaction to proceed in the dark at room temperature. When 
the blue colour has developed, read the plate at a wavelength 
of 630 nm. 



15 



(D) SDS-PAGE 



^ Followmg purification of Ab-DNase and Fab-DNase, the fusion proteins 
were analysed by SDS-PAGE under non-reduciag and reducing 
20 conditions, as described in Section (B) of Example 1 . 



In brief, afBnity-purified material was used. In the case of the Ab-DNase 
fusion protein, this was from a sample dialysed and concentrated (as 
described m 4e protein A protocol above). In the case of the Fab-DNase, 
25 this was unconcentrated protein directiy eluted from the protem L afBnity 
column. 15 ul of the Fab-DNase protein-L eluate was mixed with 5 ul of 
either reducing or non-reducing loadmg buffer whereas 2 ul of the Ab- 
DNase protein A eluate (dialysed and concentratec^) was mixed with 5 ul 
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of either reducing or non-reducing buffer. Both samples were boiled for 
5 minutes and were loaded onto the gel. The gels were stained wifli 
Coomassie Brilliant Blue stain. The cells were not labelled with 35S- 
methionine (as in Example 1). 

5 

The SDS-PAGE autoradiograph for Ab-DNase is shown in Figure 33(a). 
Under reducing conditions, Ab-DNase produces a band of about 80 kDa, 
which corresponds to the expected size of the heavy chain-DNase fusion 
product (see lane 3), A further band of about 50 kDa is also observed, 
10 which is approximately Ihe same molecular weight as the hHMFG-1 heavy 
chain (see lane 4). 

The SDS-PAGE autoradiograph for Fab-DNase is shown in Figure 33(b). 
Under reducing conditions, Fab-DNase produces a band of about 55- 

15 60 kDa, which corresponds to the expected size of Fab-DNase (see 
lane 3). Under non-reducing conditions, a band of about 80-85 kDa is 
observed, which is the q)proximate molecular weight of Fab-DNase rather 
than F(ab')2-DNase (see lane 4). Thus, the Fab-DNase appears to exist 
as a dimer of the hHMFG-1 light chains and the hHMFG-1 heavy 

20 cham/human DNase fusion, not a tetrameric F(ab')2-I>Nase. 

(E) Measurement of DNase activity of hHMFG-l/DNase constructs 

DNase activity of the two fusion proteins was detenniued as described in 
25 Section (D) of Example 1. In brief, 0.1 ml of the purified protein was 
added to a 1:1 solution of assay buffer and methyl green-salmon sperm 
DNA complex, and the mixture mcubated at 37*^C. A reduction in 
absorbance at 620 nm is indicative of DNA activity. 



wo 01/74905 



PCT/GBOl/01324 



49 

A standard curve produced using bovine DNase I is shown in 
Figure 34(a). 

5 Figure 34(b) shows the DNase activity of flie Fab-DNase and Ab-DNase 
fusion proteins 3 h and 6 h after being added to the DNA, can5)ared to a 
positive control of bovine DNase and a negative control of Fab only. 
Clearly, the DNase activity of the Fab-DNase and Ab-DNase fusion 
proteins is comparable to that of tiie bovine DNase positive control. 

10 

(F) CMotoxicitv of DNase activity of hHMFG-l/DNase constructs 

Cytotoxocity of the Fab-DNase and Ab-DNase fiision protems was 
analysed using two tumour cell lines, the human malignant melanoma cell 
15 line A375 and the hunoan ovarian adenocarcinoma cell line OVCAR 3. 

An initial cell-based ELISA was performed using hBDMFG-1 antibodies to 
■ determine ihe level of ejcpression of PEM (the MUCl gene product) on 
these cells. 

20 

Cell-based PEM ELISA assay protocol 
Materials and methods 

25 1. Phosphate buffered saline tablets (Sigma P-4417) 

2. 50 % glutaraldehyde solution (BDH UN2810 Prod. 2868240) 

3. sodium azide (Sigma S-8032) 

4. Nundon 96 well tissue culture plate (Nunc D167008) 
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5. BSA (Sigma A.9647) 

6. OVCAR-3 ovarian cancer cells, A375 melanoma cancer cells 
both from ATCC 

7. TMB substrate buffer (Sigma P-4417) 
5 8. Tween 20 (Sigma P7949) 

9. Purified humanised HMFGl (1 mg/ml from ICRF) 

10. RPMI 1640 media (Gibco 21875-034) 



Protocol 



10 



1. The OVCAR-3 and A375 cells were grown in RPMI containing 
20% and 10% PCS respectively at 37^C in 5% C02 in a 96 well 
tissue culture plate, seeded at 106 cells/ml with 0.1 ml/well. 

2. Excess media was removed and the plate was fixed with 0.05 % 
15 glutaraldehyde in water for 1 hour at room temperature. 

3. Excess ^utaraldehyde/water solution was removed and the plates 
were washed three times with PBS containing 0.05% Tween 20. 
The plate was stored at 4°C until required in PBS with 0.02% 
sodium azide), 

20 4. To use the plate, the plate was then washed with three washes of 

PBS containing 0.05% Tween 20, and the wells were blocked 
with 0.1 ml 5% BSA in PBS containmg 0.05% Tween 20. The 
wells were blocked for 1 hour at 30°C. 
5. They washed three times as described before. Serial dilutions of 

25 hHMFGl were plated out on the wells from a TnayirmTTTi 

concentration of 2 ^g/ml downward. Dilutions of constructs 
were also sunilarly plated onto the fixed cells. All dilutions 
were prepared in PBS containing 0.05 % Tween 20. 
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6. The proteins were iacubated with the fixed cells for 1 hour at 
30°C and were again washed three times as described above. 

7. And-htnnan IgG-Fc peroxidase conjugate andbody (Jackson 209- 
035-103) was dihited to 1:2000 in PBS contammg 0.05% Tween 
20. This was incubated at SO^C for 30 minutes. 

8. Once again the cells were washed as described as before. Then 
0. 1 ml TMB substrate was put in each well and the colour was 
developed at room temperature and the absorbance at 655 nm 
was determined. 

For conq)arison, an additional ELISA using Ab-DNase was performed 
with the OVCAR 3 cells. 

Antigen-bound hHMFG-1 and Ab-DNase was detected by a peroxidase- 
15 conjugated anti-human Fc antibody. 

The results of the ELISA are shown in Figure 35, mdicating that the 
OVCAR 3 cell line expresses high levels of PEM (as measured by both 
hHMFG-1 and Ab-DNase) while flie A375 cell Une expresses low levels 
20 of ¥EM (and hence can be used as a negative control in cytotoxicity 
experiments). 

Cytotoxicity was measured using an LDH release assay, as described in 
Section (E) of Exanqile 1. In brief, 10' cells per well of the A375 and 
25 OVCAR 3 cell lines were plated in a 96-well plate and grown for 24 
hours. Fifteen microlitres of the purified fusion proteins (containing 
200 ng of Ab-DNase or 100 ng of Fab-DNase) were added to the cells and 
incubated for 48 hours at 37*>C. A negative control group of each cell 



wo 01/74905 



PCT/GBOl/01324 



52 

type was treated with 200 ng of the hHMFG-1 antibody (i.e. not fused to 
DNase). 

Following the incubation period, 50 fil of the supernatant was removed 
5 and incubated with 50 ^1 of tetrazolium-containing substrate buffer for 30 
minutes at 22®C. The reaction was stopped with stop buffer (Promega) 
and the absorbance of the reaction mixture at 490 nm measured. 

Both Fab-DNase and Ab-DNase fusions show cell killing of OVCAR 3 
10 cells as compared to the negative control hHMFG-1 treated cells. In 
contrast, killing of A375 cells by DNase fusions is negligible, consistent 
with negligible binding of the fusions to these cells. 



wo 01/74905 



PCT/GBOl/01324 



53 

CLAIMS 

1 . A conipoiuid comprising a target cell-specific portion and a cytotoxic 
portion characterised in that: 

(i) the target cell-specijBc portion comprises an humanised monoclonal 
antibody having specificity for polymorphic q>iflielial mucin (PEM), 
or an antigen binding fragment thereof; and 

(ii) the cytotoxic portion lias endonucleolytic activity. 

2. A compound according to Claim 1 wherein the target cell-specific 
portion comprises an humanised HMFG-1 antibody or an antigen 
buidiag fragment thereof. 

3. A compound according to Claim 2 wherein the target cell-specific 
portion is an humanised HMFG-1 antibody. 

4. A con?)ound according to Claim 1 or 2 wherein flie target cell-specific 
portion comprises an antigen binding fragment of the humanised 
antibody selected from the group consisting of Fab-like molecules, 
such as Fab and F(ab02, Fv molecules, disulphide-linked Fv 
molecules, ScFv molecules and single domain antibodies (dAbs). 

5. A compound according to Claim 4 wherein the target cell-specific 
portion cornprises a Fab molecule. 

6. A compound accordiag to Claun 4 wherein the target cell-specific 
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portion coirprises a F(ab')2 molecule. 

7. A con5)oimd according to Claim 1 wherein the target cell-specific 
portion comprises an amino acid sequence encoded by at least part of 
one or both of the nucleotide sequences of Figure 3(a) and (d). 

8. A compound according to Claim 7 wherein the target cell-specific 
portion comprises an amino acid sequence encoded by the nucleotide 
sequCTce of Figure 3(a) and an amino acid sequence encoded by the 
nucleotide sequence of Figure 3(d). 

9. A compound according to any one of Claims 1 to 8 wherein the 
cytotoxic portion has DNA endonucleolytic activity. 

10. A compound according to Claim 9 wherem the cytotoxic portion is at 
least the catalytically active portion of a DNA endonuclease. 

11. A compound according to Claim 10 wherein the endonuclease is a 
mammalian deoxyribonuclease 1. 

12. A compound according to Claim 11 wherein the endonuclease is a 
human deoxyribonuclease L 

13. A compound according to Claim 1 wherein the endonuclease is a 
restriction endonuclease. 

14. A compound according to Claim 10 wherein the cytotoxic portion 
con:5)rises the ammo acid sequence shown in Figure 2(a) or (b). 
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15. Aa)ii?)OuiidaccoidmgtoanyoiieofaaimsltDl4wher^^ 
localization signal is incorporated. 

16. A componnd according to Claim 15 wherein the nuclear localization 
signal comprises the sequence PKKKRKV. 

17. A compound according to any one of Claims 1 to 16 wherein the 
target cell-specific portion and die cytotoxic portion are fiised. 

18. A compound accordnig to Qaim 17 wherein the target cell-specific 
portion and the cytotoxic portion are separated by a linker sequence. 

19. A conqxrand accordmg to Claim 18 wherein the Imker sequence is or 
conq>iises GG or GSGG. 

= 20. A compound according to any one of Qauns 1 to 19 wherdn the 
compound comprises all or part of the ammo acid sequence as shown 
m Figure 3(c) together with all or part of an ammo acid sequence 
selected from flie group consisting of ammo acid sequences as shown 
in Figures 5(d), 6(d), 7(b), 8(b). 9(b), 10(b), 11(b), 12(b), 13(d), 
14(d), 15(d), 16(c), 17(d), 18(d) and 19(d). 

21 . A coi]5)ound according to Claim 20 wherem the conqwund con^aises 
an amino acid sequence as shown in Figure 3(c) and an ammo acid 
sequHice as shown in Figure 7(b). 

22. Acon5)oundaccoKiingtoClann20whereinfliecon5)Oimdc(m5jrises 
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an amino acid sequence as shown in Figure 3(c) and an amino acid 
sequence as shown in Figure 14(d). 

23. A nucleic acid molecule encoding a compound as defined in any one 
of Claims 1 to 22. 

24. A nucleic acid molecule according to Claim 23 wherein the molecule 
coniprises all or part of the nucleotide sequence as shown in 
Figure 3(a or b) together with all or part of a nucleotide sequence 
selected from the group consisting of nucleotide sequences as shown 
m Figures 5(a, b and c), 6(a, b and c), 7(a), 8(a), 9(a), 10(a), 11(a), 
12(a), 13(a, b and c), 14(a, b and c), 15(a, b and c), 16(a and b), 
17(a, b and c), 18(a, b and c) and 19(a, b and c). 

25. A nucleic acid molecule according to Clann 24 wherein the molecule 
comprises a nucleotide sequence as shown in Figure 3(b) and a 
nucleotide sequence as shown in Figure 7(a). 

25. A nucleic acid molecule according to Claim 24 wherein the molecule 
comprises a nucleotide sequence as shown in Figure 3(b) and a 
nucleotide sequence as shown in Figure 14(c). 

26. A nucleic acid molecule according to any one of Claims 23 to 25 
wherein the molecule further comprises a Kozak consensus ribosome- 
binding site. 

27. A vector comprising a nucleic acid molecule according to any one of 
Clanns 23 to 26. 
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28. A host cell comprising a vector according to Qaim 27. 

29. A pharmaceutical composition comprising a compoimd according to 
any one of Claims 1 to 22 and a phannaceutically acceptable carrier. 

30. A compound according to any one of Qaims 1 to 22 for use in 
medicine. 

31. Use of a compound according to any one of Claims 1 to 22 in the 
preparation of a medicament for treating a mammal having said target 
cells to be destroyed. 

32. A method of treating a mammal having target cells to be destroyed, the 
method comprising administering a compound according to any one of 
Claims 1 to 22 to said mammal. 

! 33. A use according to Claim 31 or a method according to Claim 32 
wherein the mammal is a human. 

34. A use according to Clahn 31 or a method according to Claim 32 
wherein the target cells to be destroyed are cancer cells. 

35. A use or a method according to Claim 34 wherein the cancer cells are 
epithelial cancer cells. 

36. A use or a method according to Claim 35 whereia the cancer cells are 
ovarian, gastric, colorectal and/or pancreatic cancer cells. 
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37- A use or a method according to Qaira 36 wherein the cancer cells are 
ovarian cancer cells. 

38. A componnd substantially as described herein, preferably with 
reference to one or more of the accompanying figures. 
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Human PNase I 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 



siq peptide 



CDS 



gene 



mat peptide 



HUMDNASEI 1039 bp mRNA PRI 06-MAR-1995 

Human DHase I mRNA/ complete cds. 

M55983 

M55983.1 GI:181623 
DNase I. 

Human pancreus, cDNA to mRNA. 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1039) 

Shak/S., Capon,D.J., Helltaiss,R. , Marsters^S.A. and Baker,C.L. 
RecoxDblnant human DNase I reduces the viscosity of cystic fibrosis 
sputum 

Proc. Natl. Acad. Sci. U.S.A. 87 (23), 91B8-9192 (1990) 
91067672 

Location/Qualifiers 
1..1039 

/organisnR^Homo sapiens" 
/db_xref ^'•taxon : 9 606" 
/ clone="hDNase-18 -1" 
/ tissue^type^^pancreas " 
160.. 225 
/gene=" DNase I" 
160,.1008 
/gene="DNase I" 
/ codon_s tart«l 
/product="DNase I" 
/protein_id="AAA63170J," 
/db xref="GI: 181624" 

/translation='TmGMKLLGALLALaALLQGAVSLKIAAFNIQTPGETKMSra 
SYIVQILSRYDIALVQEVRDSHLTAVGKLLDNLNQDAPDTYHYWSEPLGRNSYKERY 
LFVYRPDQVSAVDSYYYDDGCEPCGNDTFNREPAIVRFFSRFTEVREFAIVPLHAAPG 
DAVAEIDALYDVYLDVQEKWGLEDVMLMGDFNAGCSYVRPSQWSSIRLWTSPTFQWLI 
PDSADTTATPTHCAYDRIWAGMLLRGAWPDSALPFNFQAAYGLSDQLAQAISDHYP 
VEVMLK" 
160. .1008 
/gene=" DNase I" 
226. .1005 
/gene=" DNase I" 
/product='*DNase I" 
226 a 305 c 282 g 226 t 



BASE COUNT 
ORIGIN 

1 tcctgcacag gcagtgcctt gaagtgcttc ttcagagacc tttcttcata gactactttt 
61 ttttctttaa gcagcaaaag gagaaaattg tcatcaaagg atattccaga ttcttgacag 
121 cattctcgtc atctctgagg acatcaccat catctcagga tgaggggcat gaagctgctg 
181 ggggcgctgc tggcactggc ggccctactg . cagggggccg tgtccctgaa gatcgcagcc 
241 ttcaacatcc agacatttgg ggagaccaag atgtccaatg ccaccctcgt cagctacatt 
301 gtgcagatcc tgagccgcta tgacatcgcc ctggtccagg aggtcagaga cagccacctg 
351 actgccgtgg ggaagctgct ggacaacctc aatcaggatg caccagacac ctatcactac 
421 gtggtcagtg agccactggg acggaacagc tataaggagc gctacctgtt cgtgtacagg 
481 cctgaccagg tgtctgcggt ggacagctac tactacgatg atggctgcga gccctgcggg 
541 aacgacacct tcaaccgaga gccagccatt gtcaggttct tctcccggtt cacagaggtc 
601 agggagtttg ccattgttcc cctgcatgcg gccccggggg'acgcagtagc cgagatcgac 
661 gctctctatg acgtctacct ggatgtccaa gagaaatggg gcttggagga cgtcatgttg 
721 atgggcgact tcaatgcggg ctgcagctat gtgagaccct cccagtggtc atccatccgc 
781 ctgtggacaa gccccacctt ccagtggctg atccccgaca gcgctgacac cacagctaca 
841 cccacgcact gtgcctatga caggatcgtg gttgcaggga tgctgctccg aggcgccgtt 
901 gttcccgact cggctcttcc ctttaacttc caggctgcct atggcctgag tgaccaactg 
961 gcccaagcca tcagtgacca ctatccagtg gaggtgatgc tgaagtgagc agcccctccc 
1021 cacaccagtt gaactgcag 



Fig.l 



SUBSTITUTE SHEET (RULE 26) 
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Human DNase I construct 



LOCOS 

DEFINITION 
site 

ACCESSION 
NID 

KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 



sig_peptide 



CDS 



gene 



MHDNASE.DN 783 bp mRNA PRI 06-MAR-1995 

Human DNase I mRNA, complete cds. Mature sequence modified to remove Narl 

M55983 
gl81623 
DNase I. 

Human pancreus, cDNA to mRNA* 
Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1039) 

Shak, S., Capon, D.J. , Hellmiss,R., Marsters, S.A. and Baker,C.L. 
Recombinant human DNase I reduces the viscosity of cystic fibrosis 
sputum 

Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 
91067672 

Location/Qualifiers 
1. .1039 

/organism="Homo sapiens" 
/db_xref=''taxon: 9606" 
/clone="hDNase-18-l" 
/tis5ue_typ€= "pancreas" 
160. .225 
/gene= "DNase I" 
160. .1008 
/gene= "DNase I" 
/codon_start=l 
/product^" DNase I" 
/db_xref="PID:glB1624" 

/translation="LKIAAFNIQTFGETBayiSNATLVSYIVQILSRYDIALVQEVRDSH 
LTAVGKLLDNLNQDAPDTYHYWSEPLGRNSyKERYLFVYRPDQVSAVDSYyyDDGCE 
PCGNDTFNREPAIVRFFSRFTEVREFAIVPLHAAPGDAVAEIDALYDVYLDVQEKWGL 
EDVMLMGDFNAGCSYVRPSQWSSIRLWTSPTFQWLIPDSADTTATPTHCAYDRIWAG 
MLLRGAWPDSALPFNFQAAYGLSDQLAQAISDHYPVBVMLK" 
160. .1008 



mat_peptide 



BASE COUNT 
ORIGIN 



168 



/gene="DNase I" 
226. .1005 
/gene="DNase I" 
/product^" DNase I" 
a 236 c 220 g 



159 t 



1 CTGAAGATCG CAGCCTTCT^ CATCCAGACA 
61 CTCGTCAGCT ACATTGXGCA GATCCTGAGC 
121 AGAGACAGCC ACCTGACTGC CGTG6GGAAG 
181 GACACCTATC ACTACGTGGT CAGTGAGCCA 
241 CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT 
301 TGCGAGCCCT GCGGGAACGA CACCTTCAAC 
361 CGGTTCACAG AGGTCA6GGA 6TTT6CCATT 
421 GTAGCrCGAGA TCGACGCTCT CTATGACGTC 
481 GAGGAC6TCA TGTTGATGGG CGACTTCAAT 
541 TGGTCATCCA TCCGCCTGTG GACAAGCCCC 
601 GACACCACAG CTACACCCAC GCACTGTGCC 
661 CTCCGA66GG CCGTTGTTCC CGACTCGGCT 
721 CTGAGTGACC AACiTGGCCCA AGCCATCAGT 
781 TGh 



TTTGGGGAGA CCAAGATGTC CAATGCCACC 
CGCTACGACA TCGCCCTG6T CCA6GAGGTC 
CTGCTGGACA ACCTCAATCA GGACGCACCA 
CTG6GAC6GA ACAGCTATAA GGAGCGCTAC 
GCGGTGGACA GCTACTACTA CGATGATG6C 
CGAGAGCCIAG CCATT6TCA6 GTTCTTCTCC 
GTTCCCCTGC ATGC66CCCC (^GGGGACGCA 
TACCTGGATG TCCAAGAGAA ATGGGGCTTG 
6CGGGCTGCA 6CTATGTGAG ACCCTCCCA6 
ACCTTCCAGT GGCTGATCCC CGACAGCGCT 
TATGACAGGA TCGTGGTTGC AGGGATGCTG 
CTTCCCTTTA ACTTCCAG6C TGCCTATGGC 
GACCACTATC CAGT6GAGGT GATi^CTGAAG 



Fig. 2(A) 
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LOCUS PAS155_GB. 858 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers- 
frag 10.. 75 

/note="160 to 225 of Untitledl" 
frag <10..>75 

/note="l to 1039 of M55983.DNA [Split]" 
source <10.->75 

/organism="HomD sapiens" 

/c3b_xref="taxon: 9606" 

/clone=="hDNase-18-l" 

/tissue_type«"pancreas [Split]" 
sig^peptide 10. . 75 

/gene="DNase I" 
CDS 10..>75 

/gene«"DNase I" 

/codon_start«"l" 

/product="DNase I" 

/db_xref«"PID:gl81624" 

/translation='WGMKLLGALIJU:iAALLQGAVS|LKIAAFNIQTFGETKM 

SYIVQILSRYDIALVQEVRDSHLTAVGKLLDNLNQDAPDTYHyWSEPLGRNSYKERY 

LFVYRPDQVSAVDSYYYDDGCEPCGNDTFNREPAIVRFFSRFTEVREFAIVPIHAAPG 

D... [Split]" 
gene 10. •>75 

/gene="DNase I [Split]" 
frag 76.. 858 

/note~"l to 783 of mod hiimanDNAsel" 
frag 76.. 858 

i /note="l to 72 of 1041inker" 

^ frag join (76. .>129,<131- .147) 

/note«"l to 72 of 1031inker [Split]" 
frag j oin ( 7 6 . . >126, <127 . . >129, <131 • . 147 ) 

/note-"l to 78 of 1021inker [Split]" 
BASE COUNT 177 A 260 C 251 G 170 T 0 OTHER 

ORIGIN 

1 GCCGCGACCA TGA6GGGCAT GAAGCTGCTG GGGGCGCTGC TGGCACTGGC GGCCCTACTG 
61 CAGGGGGCCG TGTCCCTGAA GATCGCAGCC TTCAACATCC AGACATTTGG GGAGACCAAG 
121 ATGTCCAATG CCACCCTCGT CAGCTACATT GTGCAGATCC TGAGCCGCTA CGACATCGCC 
181 CTGGTCCAGG AGGTCAGAGA CAGCCACCTG ACTGCCGTGG GGAAGCTGCT GGACAACCTC 
241 AATCAGGACG CACCAGACAC CTATCACTAC GTGGTCAGTG AGCCACTGGG ACGGAACAGC 
301 TATAAGGAGC GCTACCTGTT CGTGTACAGG CCTGACCAGG TGTCTGCGGT GGACAGCTAC 
361 TACTACGATG ATGGCTGCGA GCCCTGCGGG AACGACACCT TCAACCGAGA GCCAGCCATT 
421 GTCAGGTTCT TCTCCCGGTT CACAGAGGTC AGGGAGTTTG CCATTGTTCC CCTGCATGCG 
481 GCCCCGGGGG ACGCAGTAGC CGAGATCGAC GCTCTCTATG ACGTCTACCT GGATGTCCAA 
541 GAGAAATGGG GCTTGGAGGA CGTCATGTTG ATGGGCGACT TCAATGCGGG CTGCAGCTAT 
601 GTGAGACCCT CCCAGTGGTC ATCCATCCGC CTGTGGACAA GCCCCACCTT CCAGTGGCTG 
661 ATCCCCGACA GCGCTGACAC CACAGCTACA CCCACGCACT GTGCCTATGA CAGGATCGTG 
721 GTTGCAGGGA TGCTGCTCCG AGGGGCCGTT GTTCCCGACT CGGCTCTTCC CTTTAACTTC 
781 CAGGCTGCCT ATGGCCTGAG TGACCAACTG GCCCAAGCCA TCAGTGACCA CTATCCAGTG 
841 GAGGTGATGC TGAAGTGA 

// 



Fig. 2(B) 
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LOCUS 

DEFINITION 

ACCESSION 

KEYWORDS 

SOURCE 

ORGANISM 
REFERENCE 

AUTHORS 

TITLE 

JOURNAL 
COMMENT 
FEATURES 

SITES 



pAS6 - light chain 

HMFG1LC2.D 721 bp DNA 

HUMANISED HMFGl LIGHT CHAIN Vnp LEADER. 



18-ADG-1998 



1 (BASES 1 TO 342) 
VERHOEYEN ET AL 
CONSTRUCTION OF RESHAPED HMFGl 
IMMUNOL. (1993) :78, 364-370 
SCANNED IN FROM JOURNAL 



ETC 



This is the sequence of the HMFGl light chain gene with the 
Vnp leader sequence attached. Translate from 
residue 1. Note residue 399 is T > A in all clones leading 
to R133 silent mutation (T in Verhoeyen paper) 

BASE COUNT 197 a 202 c 182 g 140 t 

ORIGIN ? 

X LKPiPKR REQ 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCcIgAC 

61 ATCCAGATGA CCCAGAGCCC AA6CAGCCTG AGCGCCAGCG TGGGTGACAG AGTGACCATC 
121 ACCTGTAAGT CCAGTCAGAG CCTTTTATAT AGTAGCAATC AAAAGATCTA CTTGGCCTGG 
181 TACCAGCAGA AGCCAGGTAA GGCTCCA7UV6 CTGCTGATCT ACTGGGCATC CACTAGGGAA 
241 TCTGGTGTGC CAAGCAGATT CAGCGGTAGC GGTAGCGGTA CCGACTTCAC CTTCACCATC 
301 AGCAGCCTCC AGCCAGAGGA CATCGCCACC TACTACTGCC AGCAATATTA TAGATATCCT 
361 CGGACGTTCG GCCAAGGGAC CAAGGTGGAA ATCAAACGAA CTGTGGCTGC ACCATCTGTC 
421 TTCATCTTCC CGCCATCTGA TGAGCAGTTG AAATCTGGAA CTGCCTCTGT TGTGTGCCTG 
481 CTGAATAACT TCTATCCCAG AGAGGCCAAA GTACAGTGGA AGGTGGATAA CGCCCTCCAA 
541 TCGGGTAACT CCCAGGAGAG TGTCACAGAG CAGGACAGCA AGGACAGCAC CTACAGCCTC 
601 AGCAGCACCC TGACGCTGAG CAAAGCAGAC TACGAGAAAC ACAAAGTCTA CGCCTGCGAA 
661 GTCACCCATC AGGGCCTGAG CTCGCCCGTC ACAAAGAGCT TCAACAGGGG AGAGTGTTAG 
721 A 

// 



Fig. 3(A) 
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LOCUS 



HHMFGIKLC 



730 BP SS-DNA 



29-AUG-2000 



DEFINITION 
ACCESSION 
KEYWORDS 
SOURCE 



FEATURES 
frag 



frag 



frag 



frag 



Location/Qualifiers 
10. .730 

/note="l to 721 of hHMFGllight chain" 
10.. 730 

/note«"l to 72 of 1041inker" 
join(10. .>63,<65. .81) 
/note^'^l to 72 of 1031inker [Split]" 
joindO. .>60,<61. .>63, <65. . 81) 
/note="l to 78 of 1021inker [Split]" 



BASE COUNT 
ORIGIN 



198 A 



208 C 184 G 140 T 0 OTHER 



1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCGACA TCCAGATGAC CCAGAGCCCA AGCAGCCTGA GCGCCAGCGT GGGTGACAGA 
121 GTGACCATCA CCTGTAAGTC CAGTCAGAGC CTTTTATATA GTAGCAATCA AAAGATCTAC 
181 TTGGCCTGGT ACCAGCAGAA GCCAGGTAAG GCTCCAAAGC TGCTGATCTA CTGGGCATCC 
241 ACTAGGGAAT CTGGTGTGCC AAGCAGATTC AGCGGTAGCG GTAGCGGTAC CGACTTCACC 
301 TTCACCATCA GCAGCCTCCA GCCAGAGGAC ATCGCCACCT ACTACTGCCA GCAATATTAT 
361 AGATATCCTC GGACGTTCGG CCAAGGGACC AAGGTGGAAA TCAAACGAAC TGTGGCTGCA 
421 CCATCTGTCT TCATCTTCCC GCCATCTGAT GAGCAGTTGA AATCTGGAAC TGCCTCTGTT 
481 GTGTGCCTGC TGAATAACTT CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC 
541 GCCCTCCAAT CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 
601 TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA CAAAGTCTAC 
661 GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA CAAAGAGCTT CAACAGGGGA 
721 GAGTGTTAGA 




wo 01/74905 



PCT/GBOl/01324 



6/113 



HMFG-1 light chain with Vnp Leader (shaded) 



^^^^^^S^^^IQMTQSPSSLSASVGDRVnTCKSSQSL 

LYSSNQKmAWQQKPGKAPKLLIWASTR^ 

DFTraSSU3reDIATYY(XK?YYRYPRTFGQGTKVEIKRTVAAPSVn 

FPPSDEQLKSGTASWCLLNNFYPREAKVQWKVDNALQSGNSQESV 

TEQDSKDSTYSI^STLTLSKADYEKHKVYAOEVTHQGLSSPVTKSFN 

RGEG 



Fig. 3(C) 
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pAS6 - heavy chain 



LOCUS HHMFGIHC.D 1404 bp DNA 14-AD6-1998 

DEFINITION HUMRMISED HMFGl heavy chain 

ACCESSION HHMFGIH 

KEYWORDS 

SOURCE 

ORGANISM 
REFERENCE 

AUTHORS VERflOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFGl etc 

JOURNAL IMMUNOL. (1993) :1Q, 364-370 

COMMENT VH domain SCANNED IN FROM JOURNAL 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent lautation in all clones 

SITES Note 
BASE COUNT 333 a 439 c 379 g 253 t 

ORIGIN ? 

^ LEADER 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCjCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTAC^IC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GAC^IGTCACT ACSAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTC^AGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GC:ACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 TUVCGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 

TCCTGGGGGG ACCGTCAGTC 
CCCGGACCCC TGAGGTCa^CA 

841 TGCGTGGTGG TGGACGTGAG CCSlCGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
1261 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCrPCrPGCACA ACCACTACAC GCSWSAAGACSC 
1381 CTCTCCCTGT CTCCGGGTAA 



ul gacaaaactc auaca|i'(3 ccc accgtgccca gcacctgaat 
781 ttcctcttcc ccccaaaacc caaggagacc ctcatgatct 



. ATGAI 



Antibody DNase Fusions Made Here 
(eg pAS34 39,) 



^ End of lower hinge region of heavy chain. RAPE Amino 
Acid Seq. Fab '2 fusions were made at this point. 

Those with HYBRID HINGES are altered further 
up /I.e. 



^ This part GACAAAACTGACACA 
D K T H T 

After this sequence you get the HYBRID HINGE + LINKER SEQUENCES 
Then DNAse I (eg Fab-DNase construct pAS302} 



Fig. 3(D) 
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PAS23 

LOCUS PAS23.DNA 1554 bp mRNA PRI 06-MAR-1995 

DEFINITION Htunaniseci HMFGl Fab '2 fused to hiiman DNase I (construct 1) 
ACCESSION 
NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASEl.dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chorda tia; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
AUTHORS Shakes., Capon, D. J., Hellmiss,R. , Marsters, S.A. and Baker, C.L. 
TITLE Recombinant human DNase I reduces the viscosity of cystic 
fibrosis • 

sputum 

JOURNAL Proc. Natl, Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 

MEDLINE 91067672 
BASE COUNT 344 a 468 c 434 g 308 t 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCC AG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACXTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA 7\AGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGA A6 GG7VGCGGCGG G CTGAAGATC 
781 GCAGCCTTCA ACATCCAGAC ATTTGGGGAG ACCAAGATGT CCAATGCCAC CCTCGTCAGC 
841 TACATTGTGC AGATCCTGAG CCGCTACGAC ATCGCCCTGG TCCAGGAGGT CAGAGACAGC 
901 CACCTGACTG CCGTGGGGAA GCTGCTGGAC AACCTCAATC AGGACGCACC AGACACCTAT 
961 CACTACGTGG TCAGTGAGCC ACTGGGACGG AACAGCTATA AGGAGCGCTA CCTGTTCGTG 
1021 TACAGGCCTG ACCAGGTGTC TGCGGTGGAC AGCTACTACT ACGATGATGG CTGCGAGCCC 
1081 TGCGGGAACG ACACCTTCAA CCGAGAGCCA GCCATTGTCA GGTTCTTCTC CCGGTTCACA 
1141 GAGGTCAGGG AGTTTGCCAT TGTTCCCCTG CATGCGGCCC CGGGGGACGC AGTAGCCGAG 
1201 ATCGACGCTC TCTATGACGT CTACCTGGAT GTCCAAGAGA AATGGGGCTT GGAGGACGTC 
1261 ATGTTGATGG GCGACTTCAA TGCGGGCTGC AGCTATGTGA GACCCTCCCA GTGGTCATCC 
1321 ATCCGCCTGT GGACAAGCCC CACCTTCCAG TGGCTGATCC CCGACAGCGC TGACACCACA 
1381 GCTACACCCA CGCACTGTGC CTATGACAGG ATCGTGGTTG CAGGGATGCT GCTCCGAGGG 
1441 GCCGTTGTTC CCGACTCGGC TCTTCCCTTT AACTTCCAGG CTGCCTATGG CCTGAGTGAC 
1501 CAACTGGCCC AAGCCATCAG TGACCACTAT CCAGTGGAGG TGATGCTGAA GTGA 



// 



Fig. 5(A) 
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LOCUS FDDNASE23_ 1554 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

• frag join(l. .>720,<787 . ,1554) 

/note-"l to 1554 of 23.dna [Split]" 
frag 721.. 78 6 

/note="l to 66 of 23/271inker" 
frag join (721. .>735,<736. .786) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 344 A 466 C 435 G 309 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTT6 GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACA2U^CAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGTCC ACCGTGTCCA GCACCAGA GG GGAGCGGCGG G CTGAAGATC 
781 GCAGCCTTCA ACATCCAGAC ATTTGGGGAG ACCAAGATGT CCAATGCCAC CCTCGTCAGC 
841 TACATTGTGC AGATCCTGAG CCGCTACGAC ATCGCCCTGG TCCAGGAGGT CAGAGACAGC 
901 CACCTGACTG CCGTGGGGAA GCTGCTGGAC AACCTCAATC AGGACGCACC AGACACCTAT 
i 961 CACTACGTGG TCAGTGAGCC ACTGGGACGG AACAGCTATA AGGAGCGCTA CCTGTTCGTG 
1021 TACAGGCCTG ACCAGGTGTC TGCGGTGGAC AGCTACTACT ACGATGATGG CTGCGAGCCC 
1081 TGCGGGAACG ACACCTTCAA CCGAGAGCCA GCCATTGTCA GGTTCTTCTC CCGGTTCACA 
1141 GAGGTCAGGG AGTTTGCCAT TGTTCCCCTG CATGCGGCCC CGGGGGACGC AGTAGCCGAG 
1201 ATCGACGCTC TCTATGACGT CTACCTGGAT GTCCAAGAGA AATGGGGCTT GGAGGACGTC 
1261 ATGTTGATGG GCGACTTCAA TGCGGGCTGC AGCTATGTGA GACCCTCCCA GTGGTCATCC 
1321 ATCCGCCTGT GGACRAGCCC CACCTTCCAG TGGCTGATCC CCGACAGCGC TGACACCACA 
1381 GCTACACCCA CGCACTGTGC CTATGACAGG ATCGTGGTTG CAGGGATGCT GCTCCGAGGG 
1441 GCCGTTGTTC CCGACTCGGC TCTTCCCTTT AACTTCCAGG CTGCCTATGG CCTGAGTGAC 
1501 CAACTGGCCC AAGCCATCAG TGACCACTAT CCAGTGGAGG TGATGCTGAA GTGA 

// 



Fig. 5(B) 
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LOCUS FDDNASE23K 1563 BP SS~DNA SYN 29-ADG--2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 1563 

/note="l to 1554 of FdDNase23correct'' 
frag join (10 . .>729, <796. .1563) 

/note-="l to 1554 of 23.dna [Split]" 
frag 730.. 795 

/note="l to 66 of 23/271ink:er" 
frag join(730. .>744,<745. .795) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 345 A 472 C 437 G 309 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC CACAAAGACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 7UUVTCTTGTG ACAAAACTCA CACATGTCCA CCGTGTCCAG CACCAGAGGG GAGCGGCGGG 
781 CTGAAGATCG CAGCCTTCAA CATCCAGACA TTTGGGGAGA CCAAGATGTC CAATGCCACC 
I 841 CTCGTCAGCT ACATTGTGCA GATCCTGAGC CGCTACGACA TCGCCCTGGT CCAGGAGGTC 
901 AGAGACAGCC ACCTGACTGC CGTGGGGAAG CTGCTGGACA ACCTCAATCA GGACGCACCA 
961 GACACCTATC ACTACGTGGT CAGTGAGCCA CTGGGACGGA ACAGCTATAA GGAGCGCTAC 
1021 CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT GCGGTGGACA GCTACTACTA CGATGATGGC 
1081 TGCGAGCCCT GCGGGAACGA CACCTTCAAC CGAGAGCCAG CCATTGTCAG GTTCTTCTCC 
1141 CGGTTCACAG AGGTCAGGGA GTTTGCCATT GTTCCCCTGC ATGCGGCCCC GGGGGACGCA 
1201 GTAGCCGAGA TCGACGCTCT CTATGACGTC TACCTGGATG TCCAAGAGAA ATGGGGCTTG 
1261 GAGGACGTCA TGTTGATGGG CGACTTCAAT GCGGGCTGCA GCTATGTGAG ACCCTCCCAG 
1321 TGGTCATCCA TCCGCCTGTG GACAAGCCCC ACCTTCCAGT GGCTGATCCC CGACAGCGCT 
1381 GACACCACAG CTACACCCAC GCACTGTGCC TATGACAGGA TCGTGGTTGC AGGGATGCTG 
1441 CTCCGAGGGG CCGTTGTTCC CGACTCGGCT CTTCCCTTTA ACTTCCAGGC TGCCTATGGC 
1501 CTGAGTGACC AACTGGCCCA AGCCATCAGT GACCACTATC CAGTGGAGGT GATGCTGAAG 
1561 TGA 

// 



Fig. 5(C) 
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9 IB 27 • 36 45 54 

5' ATG GGA TGG.AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA OCT ACA GGT GTC CAC 

MGWSCIILFL VATATGV H 

* 63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAQ TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



il7 126 • 135 144 153 162. 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAG ACC TTC AGT GCC TAC TGG ATA GAG 



K . A S 



W 



171" 180 189 198 207 216 

TGGGTGCGCCAGGCTCCAGGAAAGGGCCTC GAGTQG GTC GGA GAG ATT TTA CCT 



W 



KG 'liEWVGE 



225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTMTAYMELSSLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDF AW FAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

W G Q 6- T LVTVSSASTKG .PS 

441 450 459 468 477 486 

GTC TTC CCC CTG .GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



495 504 513 522 531 540 

GGC TGC. CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



Fig. 5(D) 
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603 612 621 630 639 . 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TtC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPSSSLGTQ 

657 666 675 684 . 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TY ICWVNHKPSNTKV. DKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



VEPKSCDKTHTCPPCPAP 

765 774 783 792 . 801 810 

GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG 



EG SGGLKIAAFWIQTFGE 

819 828 837 846 855 864 

ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC- TAC ATT GTG CAG ATC CTG AGC CGC 



TK MSNATLVSYIVQILSR 

873 . 882 891 . 900 909 918 

TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG 



YD lALVQEVRD S H L .TAVG 

927 936 945 954 963 972 

AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC 



.KL LD N .L NQOA P D T Y H Y V V 

981 990 999 1008 1017 1026 

AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG 



SE PLGRNSY k'ERYLFVYR 

1035 • 1044 1053 1062 1071 1080 

CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC 



PDQVSAVDSYY YDDGCE.P 

1089 1098 1107 1116 1125 1134 

TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG 



CG NDT'FNREPA I V R F F S R 

1143 1152 1161 1170 1179 1188 

TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC 

PTEVREFAIVPLHAAPGD 
1197 1206 1215 1224 1233 1242 

Fig. 5(D) 
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GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA 
AVAEIDALYDVYLDV QEK 

1251 1260 1269 1278 1287 1296 

TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT 

WGLEDVMLMGDFN. AGCSY 

1305 1314 . 1323 1332 1341 1350 

GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG 

VRPSQWSSIRLWTSPTFQ 

1359 1368 1377 1386 1395 1404 

TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT 

WliIPDSADTTATPTHCAY. 

1413 1422 1431 * 1440 1449 1458 

GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC C6A GGG GCC GTT GTT CCC GAC TOG 

DRIVVAGMLLRGAVVPDS 

1467 1476 1485 1494 1503 1512 

\ GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA 

A L P F ■ N. FQAAYGLSDQL AQ 

1521 1530 1539 1548 

GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3' 

AISD HYPVEVMLK* 

Fig. 5(D) 
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PAS27 



mRNA PRI 
fused to human DNase I 



06-MAR-1995 
with SV40 



sequence is from assembled oligos (thus modified c/f 



LOCUS PAS27.DNA 1584 bp 

DEFINITION Humanised HMFGl Fab '2 
NLS (construct 1) 
ACCESSION 
NID . 

KEYWORDS DNase I, 
SOURCE DNase I 

MHDNASEl.dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
ADTHORS Shakes., Capon, D. J., Hellmiss,R., Marsters, S .A. and Baker, C.L. 
TITLE Recombinant hiiman DNase I reduces the viscosity of cystic fibrosis 
sputum 

JOURNAL Proc. Natl. Acad- Sci, U.S.A. 87 (23), 9188-9192 (1990) 

MEDLINE 91067672 
BASE COUNT 354 a 474 c 446 g 310 t 

ORIGIN 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG 

61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG 

121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC 

181 GGA7AGGGCC TCGAGTGGGT CGGAGAGATT 

241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT 

301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA 

361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG 

\ 421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC 

i 481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC 

[ 541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC 

601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC 

661 AACGTGAATC ACAAGCCCAG CAACACCAAG 

721 GACAAAACTC ACACATGCCC ACCGTGCCCA 

781 GCAGCCTTCA ACATCCAGAC ATTTGGGGAG 

841 TACATTGTGC AGATCCTGAG CCGCTACGAC 

901 CACCTGACTG CCGTGGGGAA GCTGCTGGAC 

961 CACTACGTGG TCAGTGAGCC ACTGGGACGG 

1021 TACAGGCCTG ACCAGGTGTC TGCGGTGGAC 

1081 TGCGGGAACG ACACCTTCAA CCGAGAGCCA 

1141 GAGGTCAGGG AGTTTGCCAT TGTTCCCCTG 

1201 ATCGACGCTC TCTATGACGT CTACCTGGAT 

1261 ATGTTGATGG GCGACTTCAA TGCGGGCTGC 

1321 ATCCGCCTGT GGACAAGCCC CACCTTCCAG 

1381 GCTACACCCA CGCACTGTGC CTATGACAGG 

1441 GCCGTTGTTC CCGACTCGGC TCTTCCCTTT 

1501 CAACTGGCCC AAGCCATCAG TGACCACTAT 

1561 CCCJMAA&gA AGCGCAAGGT TTGA 



GTAGCAACAG CTACAGGTGT CCACTCCCAG 
AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
TTACCTGGAA GTAATAATTC TAGATACAAT 
AGAGACACAT CCACAAACAC AGCCTACATG 
GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
CCCGAACCGG TGACGGTGTC GTGGAACTCA 
CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
AGCAGCTTGG GCACCCAGAC CTACATCTGC 
GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
GCACCTGA AG GGAGCGSCGG GC TGAAGATC 
ACCAAGATGT CCAATGCCAC CCTCGTCAGC 
ATCGCCCTGG TCCAGGAGGT CAGAGACAGC 
AACCTCAATC AGGACGCACC AGACACCTAT 
AACAGCTATA AGGAGCGCTA CCTGTTCGTG 
AGCTACTACT ACGATCS^ATGG CTGCGAGCCC 
(3CCATTGTCA GGTTCTTCTC CCGGTTCIACA 
CATGCGGCCC CGGGGGACGC AGTAGCCGAG 
GTCCAAGAGA AATGGGGCTT GGAGGACGTC 
AGCTATGTGA GACCCTCCCA GTGGTCATCC 
TGGCTGATCC CCGACAGCGC TGACACCACA 
ATCGTGGTTG CAGGGATGCT GCTCCGAGGG 
AACTTCCAGG CTGCCTATGG CCTGAGTGAC 
CCAGTGGAGG TGATGCTGAA GGGGGGCGGR. 



Fig. 6(A) 
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LOCUS FDDUASE27_ 1584 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag join(l. .>720,<787. .1584) 

/note="l to 1584 of 27.dna [Split]" 
frag 721,. 78 6 

/note="l to 66 of 23/271inker" 
frag join(721. ->735,<736. .786) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 354 A 472 C 447 G 311 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGG6CC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGTCC ACCGTGTCCA GCACCAGAGG GGAGCGGCGG GCTGAAGATC 
781 GCAGCCTTCA ACATCCAGAC ATTTGGGGAG ACCAAGATGT CCAATGCCAC CCTCGTCAGC 
841 TACATTGTGC AGATCCTGAG CCGCTACGAC ATCGCCCTGG TCCAGGAGGT CAGAGACAGC 
901 CACCTGACTG CCGTGGGGAA GCTGCTGGAC AACCTCAATC AGGACGCACC AGACACCTAT 
961 CACTACGTGG TCAGTGAGCC ACTGGGACGG AACAGCTATA AGGAGCGCTA CCTGTTCGTG 
• 1021 TACAGGCCTG ACCAGGTGTC TGCGGTGGAC AGCTACTACT ACGATGATGG CTGCGAGCCC 
1081 TGCGGGAACG ACACCTTCAA CCGAGAGCCA GCCATTGTCA GGTTCTTCTC CCGGTTCACA 
1141 GAGGTCAGGG AGTTTGCCAT TGTTCCCCTG CATGCGGCCC CGGGGGACGC AGTAGCCGAG 
1201 ATCGACGCTC TCTATGACGT CTACCTGGAT GTCCAAGAGA AATGGGGCTT GGAGGACGTC 
1261 ATGTTGATGG GCGACTTCAA TGCGGGCTGC AGCTATGTGA GACCCTCCCA GTGGTCATCC 
1321 ATCCGCCTGT GGACAAGCCC CACCTTCCAG TGGCTGATCC CCGACAGCGC TGACACCACA 
1381 GCTACACCCA CGCACTGTGC CTATGACAGG ATCGTGGTTG CAGGGATGCT GCTCCGAGGG 
1441 GCCGTTGTTC CCGACTCGGC TCTTCCCTTT AACTTCCAGG CTGCCTATGG CCTGAGTGAC 
1501 CAACTGGCCC AAGCCATCAG TGACCACTAT CCAGTGGAGG TGATGCTGAA GGGGGGC6GA 
1561 CCCAAAAAGA AGCGCAAGGT TTGA 

// 
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LOCUS FDDNASE27K 1593 BP SS-DNA SYN 29-AOG--2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 1593 

/note="l to 1584 of FdDNase2 7 correct" 
frag join(10. .>729,<796. .1593) 

/note="l to 1584 of 27.dna [Split]" 
frag 730..795 

/note«"l to 66 of 23/271inker" 
frag join(730. .>744,<745. .795) 

/note«"l to 78 of 1021inJcer [Split]" 
BASE COUNT 355 A 478 C 449 G 311 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA* 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTqCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGTCCA CCGTGTCCAG CACCAGAGGG GAGCGGCGGG 
I 781 CTGAAGATCG CAGCCTTCAA CATCCAGACA TTTGGGGAGA CCAAGATGTC CAATGCCACC 

841 CTCGTCAGCT ACATTGTGCA GATCCTGAGC CGCTACGACA TCGCCCTGGT CCAGGAGGTC 
901 AGAGACAGCC ACCTGACTGC CGTGGGGAAG CTGCTGGACA ACCTCAATCA GGACGCACCA 
961 GACACCTATC ACTACGTGGT CAGTGAGCCA CTGGGACGGA ACAGCTATAA GGAGCGCTAC 
1021 CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT GCGGTGGACA GCTACTACTA CGATGATGGC 
1081 TGCGAGCCCT GCGGGAACGA CACCTTC7AC CGAGAGCCAG CCATTGTCAG GTTCTTCTCC 
1141 CGGTTCACAG AGGTCAGGGA GTTTGCCATT GTTCCCCTGC ATGCGGCCCC GGGGGACGCA 
1201 GTAGCCGAGA TCGACGCTCT CTATGACGTC TACCTGGATG TCCAAGAGAA ATGGGGCTTG 
1261 GAGGACGTCA TGTTGATGGG CGACTTCAAT GCGGGCTGCA GCTATGTGAG ACCCTCCCAG 
1321 TGGTCATCCA TCCGCCTGTG GACAAGCCCC ACCTTCCAGT GGCTGATCCC CGACAGCGCT 
1381 GACACCACAG CTACACCCAC GCACTGTGCC TATGACAGGA TCGTGGTTGC AGGGATGCTG 
1441 CTCCGAGGGG CCGTTGTTCC CGACTCGGCT CTTCCCTTTA ACTTCCAGGC TGCCTATGGC 
1501 CTGAGTGACC AACTGGCCCA AGCCATCAGT GACCACTATC CAGTGGAGGT GATGCTGAAg 
1561 GGGGGCGGAC CCAAAAAGAA GCGCAAGGTT T6A 

II 
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9 18, 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



mg'wsci i lf'lvatat. GVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQLVQSGAEV kKPGAS 

117 ' 126 - 135 144 153 162 

GTC AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



VKVSCKASGYTFSAYWIE 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WVRQ APGKGLEWVG E I LP 

225 234 243 252 261 270 

GGk AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



GSNNSRY NEKFKGRVTVT 

279 .288 297 306 315 324 

AGA GAC ACA TCC ACA ARC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



RDTSTNTAYMELSSLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT* GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



DTAVYYCARSYDFA. WFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG' GGC CCA TCG 



WGQGTLVTVSSASTKG P'S 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VFPLAPSSKSTSGGTAAI. 

495 . 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG* AAC TCA 



GCLVKDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 



GALTSGVHTFPAVLQSSG 

Fig, 6(D) 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYSLSS -VVTVPSSSLGTQ 

657 666 675 684 693 . 702 

ACC TAC ATC TCC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAG AAG AAA 

TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG Ca: AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG 



EGSGGLKIAAFNIQTFGE 

819 828 837 846 855 864 

ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC 

TKKSNATLVSYIVQILSR 

873 882 891 900 909 918 

TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG 



YDIALVQEVRDS HLTA VG 

927 936 945 954 963 972 

AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC 



KLLDNLNQDAPDTYHYVV 

961 990 999 1008 1017 1026 

AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG 



SE.PLGRNSYKER YLFVYR 

1035 1044 1053 1062 • 1071 1080 

OCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC 



PDQVSAVDSYY YDDGC EP 

1089 1098 1107 1116 1125 1134 

TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG 

CGNDTFNREPAIVRFF SR 

1143 1152 1161 1170 1179 1188 

TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC 

FTEVRE FAIVPLHAAPGD 
1197 1206 1215 1224 1233 1242 
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GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA 

AVAE I DALYDVYLDVQ E*K 

1251 1260 1269 1278 1287 1296 

TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT 

WGLEDVMLMG DFNAGCSY 

1305 1314 1323 1332 1341 1350 

GTG AGA CCC 'TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC.ACC TTCCAG 

VRPSQWSSIRLWTS PTFQ 

1359 1368 1377 1386 • 1395 1404 

TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC T6T GCC TAT 

WLIPDSADTTATPTHCAY 

1413 1422 1431 1440 1449 1458 

GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG 

DRIVVAGMLLRGAVVP DS 

1467 1476 1485 1494 1503 1512 

GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA 

ALPFNFQAA YGLSDQL AQ 

• 1521 1530 1539 1548 1557 1566 

GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA 

AISDHYP.VEVMLK G G G P K 

1575 1584 
AAG AAG CGC AAG GTT TGA 3' 

K K R K V * 
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PAS34 



IiOCDS 

DEFINITION 
DEFINITION 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
COMMENT 



PAS34.DNA 2196 bp 2196 bp 2196 bp DNA 14-AUG-1998 

HUMANISED HMFGl heavy chain fused to human DNAse construct 34 
Clone 16.4.2 (same as hcdnasel.dna template file) 



VERHOEYEN ET AL 

CONSTRUCTION OF RESHAPED HMFGl etc 
IMMUNOL. (1993) :78, 364-370 

Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase.dna) 

COMMENT The fusion was made using overlapping oligos AS79 and AS80 
FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 
FEATURES Residue 963 is G > T leading to silent mutation in all clones 

SITES Note 
BASE COUNT 501 a 677 c 607 g 411 t 

ORIGIN ? 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CT<3GCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTG^G CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCT^GGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
1261 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGGGTAA AGGGAGCGGC GGG CTGAAGA TCGCAGCCTT CAACATCCAG 
1441 ACATTTGGGG AGACCAAGAT GTCCAATGCC ACCCTCGTCA GCTACATTGT GCAGATCCTG 
1501 AGCCGCTACG ACATCGCCCT GGTCCAGGAG GTCAGAGACA GCCACCTGAC TGCCGTGGGG 
1561 AAGCTGCTGG ACAACCTCAA TCAGGACGCA CCAGACACCT ATCACTACGT GGTCAGTCSAG 
1621 CCACTGGGAC GGAACAGCTA TAAGGAGCGC TACCTGTTCG TGTACAGGCC TGACCAGGTG 
1681 TCTGCGGTGG ACAGCTACTA CTACGATGAT GGCTGCGAGC CCTGCGGGAA CGACACCTTC 
1741 AACCGAGAGC CAGCCATTGT CAGGTTCTTC TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC 
1801 ATTGTTCCCC TGCATGCGGC CCCGGGGGAC GCAGTAGCCG AGATCGACGC TCTCTATGAC 
1861 GTCTACCTGG ATGTCCAAGA GAAATGGGGC TTGGAGGACG TCATGTTGAT GGGCGACTTC 
1921 AATGCGGGCT GCAGCTATGT GAGACCCTCC CAGTGGTCAT CCATCCGCCT GTGGACAAGC 
1981 CCCACCTTCC AGTGGCTGAT CCCCGACAGC GCTGACACCA CAGCTACACC CACGCACTGT 
2041 GCCTATGACA GGATCGTGGT TGCAGGGATG CTGCTCCGAG GGGCCGTTGT TCCCGACTCG 
2101 GCTCTTCCCT TTAACTTCCA GGCTGCCTAT GGCCTGAGTG ACCAACTGGC CCAAGCCATC 
2161 AGTGACCACT ATCCAGTGGA GGTGATGCTG AAGTGA 
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9 18 27 36 45 54 

5' ATG GGA TGG ACC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 

K G WSCIIl^FLVATATGV H 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GT6 AAA AAG CCT GGG GCC TCA 

^ S QVQLVQSGAEVKKPGA S 

117 126 135 144 153 162 

GTC AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC ACT GCC TAC TGG ATA GAG 

V KVSCKASGYTFSAYWI.E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC G(5A GAG ATT TTA CCT 

WVRQA. PGKGLEWVGEIL P 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSN NSRYNEKF KGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC .ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNT.AYMELS.SLRS E 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

\ WGQ6TLVTVSSASTKGPS 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAPSSKSTSGGTAAI. 

495 504 513 522 531 . 540 * 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLV KDYFPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALT SGVHTFPAVL QS s'G 

603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYSL SSVVTVPS. SSLGTQ 

657 666 675 684 693 702 

Fig. 7(B) 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAG AAG AAA 



T V I c N V.N H K P S NT K V D K K 



GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 

ELLGG P SVFLF PPK P K DT 

819 828 837 846 655 864 

CTC ATG ATC TCC CCG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 

LMI SRT PEVTCVVV DVS H 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

EDPEVKFNWYVDGVEVHK 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CG6 GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 

AKTK PR EEQyNSTYRVV S 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CXC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 

VLTVLHQDWLWGKEYKC K 

1035 1044 1053 i062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC. GAG AAA ACC ATC TCC AAA GCC AAA 

VSNKAL PA PI EKTI S KA K 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 

GQPREPQVYTLP PSRDE.L 

1X43 .1152 . 1161 1170 .1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 

TKNQVSLTCLVKGFYPSD 

1197 1206 1215 1224 . 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 

lAVEWESNGQPENNYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTG TAC AGC AAG CTC ACC GTG 

PP V L DS'DG S FPLYS K L T V 

1305 1314 ' 1323 . 1332 1341 1350 

GAC AAG AGC ACG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 

OK S R WQ QG NVFSC S V M H E 



711 



720 



729 



756 
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GCT 


1359 
CTG CAC AAC 


1368 
CAC TAC 


ACG 


1377 
CAG AAG 


AGC 


1386 
CTC TCC 


CTG 


1395 1404 
TCT CCG GGT AAA GGG 


A 


L H 


N 


H y 


T 


Q K 


S 


L S 


L 


S P 


G 


K G 


AGC 


1413 
GGC GGG 


CTG 


1422 
AAG ATC 


GCA 


1431 
GCC TTC 


AAC 


1440 
ATC CAG 


ACA 


1449 
TTT GGG 


GAG 


1458 
ACC AAG 


S 


G G 


L 


K I 


A 


A F 


N 


I Q 


T 


F G 


E 


T K 


ATG 


1467 
TCC AAT 


GCC 


1476 
ACC CTC 


GTC 


1485 
AGC TAC 


ATT 


1494 
GTG CAG 


ATC 


1503 
CTG AGC 


CGC 


1512 
TAC GAC 


H 


S H 


A 


T L 


V 


s * y 


I 


V Q 


I 


L S 


R 


Y D 


ATC 


1521 
GCC CTG 


GTC 


1530 
CAG GAG 


GTC 


1539 
AGA GAC 


AGC 


1548 
CAC CTG 


ACT 


1557 
GCC GTG 


GGG 


1566 
AAG CTG 


I 


A L 


V 


Q E 


V 


R D 


S 


H L 


T 


A V 


G 


K L 


CTG 


1575 
GAC AAC 


CTC 


1584 
AAT CAG 


GAC 


1593 
GCA CCA 


GAC 


1602 
ACC TAT 


1611 
CAC TAC GTG 


GTC 


1620 
AGT GAG 


L 


D N 


h 


N Q 


D 


A P 


D 


T y 


H 


Y V 


V 


S B 


CCA 


1629 
CTG GGA 


CGG 


1638 
AAC AGC 


TAT 


1647 1656 1665 1674 
AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 


P 


L G 


R 


W S 


y 


K E 


R 


Y L 


P 


V y 


R 


P D 


CAG 


1683 
GTG TCT 


GCG 


1692 
GTG GAC 


AGC 


1701 1710 1719 1728 
TAC TAC TAC GAT GAT GGC TGC GAG CCC TCC GGG 


Q 


V S 


A 


V D 


s 


y Y 


Y 


D D 


G 


C E 


P 


C G 


AAC 


1737 
GAC ACC 


TTC 


1746 
AAC CGA 


GAG 


1755 
CCA GCC 


ATT 


1764 
GTC AGG 


TTC 


1773 
TTC TCC 


CGG 


. 1782 
TTC ACA 


N 


D T 


F 


N R 


E 


P A 


I 


V R 


F 


F S 


R 


F T 


GAG 


1791 
GTC AGG 


GAG 


1800 
TTT GCC ATT 


1809 
GTT CCC 


CTG 


1818 
CAT GCG 


GCC 


1827 
CCG GGG 


GAC 


1836 
GCA GTA 


E 


V R 


£ 


F A 


I 


V P 


I. 


H A 


A 


P G 


D 


A V 


GCC 


1845 
GAG ATC 


GAC 


1854 
GCT CTC 


TAT 


1863 
GAC GTC 


TAC 


1872 
CTG GAT GTC 


1881 
CAA GAG 


AAA 


1890 
TGG GGC 


A 


E I 


D 


A L 


Y 


D V 


y 


L D 


V 


Q E 


K 


W G 


TTG 


1899 
GAG GAC 


GTC 


1908 
ATG TTG 


ATG 


1917 
GGC GAC 


TTC 


1926 
AAT GCG 


GGC 


1935 
TGC AGC 


1944 
TAT GTG AGA 


L 


E D 


V 


M h 


K 


G D 


F 


K A 


G 


C S 


y 


V R 


ccc 


1953 
TCC CAG 


TGG 


1962 
TCA TCC 


ATC 


1971 
CGC CTG 


TOG 


1980 
ACA AGC 


CCC 


1989 
ACC TTC 


CAG 


1998 
TGG CTG 


p 


S 0 


W 


S S 


I 


R L 


H 


T S 


p 


T F 


Q 


W L 


ATC 


2007 
CCC GAC 


AGC 


2016 
GCT GAC 


ACC 


2025 
ACA GCT ACA 
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2061 2070 2079 2088 2097 2106 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 

IVVA G M L L R GAV V P D S A L 

2115 2124 2133 2142 2151 2160 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 

P FNFQAAYGLSDQLAQAI- 

2169 2178 • 2187 2196 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3' 

SDHYPVEVMI/K* 
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PAS35 



LOCUS PAS35.DNA 2193 bp 2193 bp DNA 14-AUG-1998 

DEFINITION HUMANISED HMFGl heavy chain fused to human DNAse construct 35 

DEFINITION Clone 17.12.1 with silent K to K mutation (1398 A > G) 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFGl etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase.dna) 

COMMENT The fusion was made using overlapping oligos AS81 and AS82 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

FEATURES In 17.12.1 residue 1398 is A > G (silent K to K mutation) 

SITES Note 
BASE COUNT 500 a 677 c 606 g 410 t 

ORIGIN ? 

1 ATGGGATG6A GCTGTATCAT CCTCTTCTT6 GTAGCAACA6 CTACAGGTGT CCACTCCC AG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
^ 961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
1261 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT r^rndAA aGG S&gCSGCGGG CTGAAGATCG CAGCCTTCAA CATCCAGACA 
1441 TTTGGGGAGA CCAAGATGTC CAATGCCACC CTCGTCAGCT ACATTGTGCA GATCCTGAGC 
1501 CGCTACGACA TCGCCCTGGT CCAGGAGGTC AGAGACAGCC ACCTGACTGC CGTGGGGAAG 
1561 CTGCTGGACA ACCTCAATCA GGACGCACCA GACACCTATC ACTACGTGGT CAGTGAGCCA 
1621 CTGGGACGGA ACAGCTATAA GGAGCGCTAC CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT 
1681 GCGGTGGACA GCTACTACTA CGATGATGGC TGCGAGCCCT GCGGGAACGA CACCTTCAAC 
1741 CGAGAGCCAG CCATTGTCAG GTTCTTCTCC CGGTTCACAG AGGTCAGGGA GTTTGCCATT 
1801 GTTCCCCTGC ATGCGGCCCC GGGGGACGCA GTAGCCGAGA TCGACGCTCT CTATGACGTC 
1861 TACCTGGATG TCCAAGAGAA ATGGGGCTTG GAGGACGTCA TGTTGATGGG CGACTTCAAT 
1921 GCGGGCTGCA GCTATGTGAG ACCCTCCCAG TGGTCATCCA TCCGCCTGTG GACAAGCCCC 
1981 ACCTTCCAGT GGCTGATCCC CGACAGCGCT GACACCACAG CTACACCCAC GCACTGTGCC 
2041 TATGACAGGA TCGTGGTTGC AGGGATGCTG CTCCGAGGGG CCGTTGTTCC CGACTCGGCT 
2101 CTTCCCTTTA ACTTCCAGGC TGCCTATGGC CTGAGTGACC AACTGGCCCA AGCCATCAGT 
2161 GACCACTATC CAGTGGAGGT GATGCTGAAG TGA 
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9 18 27 36 45 54 

5' ATC GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC O^.C 
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TCC CAG GTG CAG CTG GT6 CAG" TCT GGG GCA GAG* GTG AAA AAG CCT GGG GCC TCA 
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GAC ACA GCC GTC *AT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCA RSYDFA WF AY 

367 396 405 414 • 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 
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S 



450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 
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K 



495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



wui.vt\>uxt PEPVTVSWNS 
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GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 
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ACC TAX ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TYICNVNHK PSNTKVDKK 

111 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

\/EPKSCDKTHTCPP CP AP 

765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 

BLLGGPSVFLFPPKPKDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 

LMISRTPEVTCVVV DV SH 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAG GTG GAC GGC GTG GAG GTG CAT AAT 

EDPEVKFNWYVOGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 

AKTKPREEQYNSTY RVVS 

981 950 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 

VLTVLHQDWLNGKE YK CK 

1035 1044 1053 1062 1071 1060 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 

VSNKALPAFIEKTI SKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 

/ GQPREPQVYTLFPSRDEL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 

TKNQVSLTCLVKGF YP SD 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 

lAVEWESNGQPENNYKTT 

i251 1260 1269 1278 1287 1296 

CCT CCC CTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 

PPVL-OSOGSFPLYS KLTV 

1305 1314 1323 1332 1341 1350 

CAC AAG AGC AGG TGG CAC CAG GGG AAC GTC TTC TCA TGC TCC GTC ATG CAT GAG 

DKSRWQQGNVFSCS VMHE 
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1359 1368 1377 1386 1395 ' 1404 

GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCT CCG AAG GGG AGC 



ALHNHYTQKSLSLSPK G S 

1413 1422 1431 1440 1449 1458 

GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG AGO AAG ATG 



G G_ LK I AAFN IQTFG ET KM 



1467 1476 1485 1494 1503 • 1512 

TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC 



S NATLVSY IV QILSRYDI 

1521 1530 1539 1548 1557 1566 

GCC- CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG 



ALVQEVRDSHLTAVGKLL 

1575 1584 1593 1602 1611 1620 

GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG OCA 



DNLNQDAPDTYHYVV5EP 

1629 ' 1638 1647 1656 1665 1674 

CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC CAG 



LGRNS YKERYLFVYRPDQ 

1683 1692 1701 1710 1719 1728 

GTG TCT GCG GTG GAC AGC TAC TAC .TAC GAT GAT GGC TGC GAG CCC TGC GGG AAC 



VSAVOSYYYDDGCEPCqN 

1737 1746 1755 1764 1773 1782 

GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG 



OTFNREPAIVRFFSR FTE 

1791 1800 1809 1818 1827 1836 

GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC 



VREFA IVPLHAAPGDAVA 

1845 1854 1863 1872 1881 1890 

GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC TTG 



EIDALYDVYLDVQEKWG L 

1899 1908 1917 1926 1935 1944 

GAG GAC CTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA CCC 



EDV M L MG D FN A G C S Y V R P 

1953 1962 1971 1980 1989 1998 

TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG ATC 

SQV/SS IRLWTS PTFQW LI 

2007 2016 2025 2034 2043 2052 

CCC GAC ACC CCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC 



P D SADTTATPTHCAYDRl 
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2061 2070 2079 2088 2097 2106 

GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG OCT CTT CCC 

V-VAGMLLRGAVVPDSALP 

2115 2124 2133 2142 2151 2160 

TTT AAC TTC GAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT 

FN FQAAYGLSDQLAQAIS 

2169 2178 2187 

GAC CAC TAT CCA GTG'GAG GTG ATG CTG AAG TGA 3' 

DHYPVEVMLK* 
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pAS36 



PAS36.DNA 2190 bp 2190 bp 



DNA 



14^AUG- 



HUMANISED HMFGl heavy chain fused to human DNAse - construct 36 
Clone 18.24.1 with residue 1392 T > C 



VERHOEYEN ET AL 

CONSTRUCTION OF RESHAPED HMFGl etc 
IMMUNOL. (1993) :78, 364-370 
Human DNAse sequence is modified as 



a result of oligo assembly 
(mhdnase.dna) 

COMMENT The fusion was made using overlapping oligos AS83 and AS84 
FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 
FEATURES Residue 963 is G > T leading to silent mutation in all clones 
FEATURES Residue 1392 T > C silent S to S mutation 

SITES Note 
BASE COUNT 498 a 678 c 605 g 409 t 

ORIGIN ? 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTT6 GTAGCAACAG CTACAGGTGT CCACTCCC AG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG T.GACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGG CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
1261 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CcCC GGGGAG CGGCGGG CTG AAGATCGCAG CCTTCAACAT CCAGACATTT 
1441 GGGGAGACCA AGATGTCCAA TGCCACCCTC GTCAGCTACA TTGTGCAGAT CCTGAGCCGC 
1501 TACGACATCG CCCTGGTCCA GGAGGTCAGA GACAGCCACC TGACTGCCGT GGGGAAGCTG 
1561 CTGGACAACC TCAATCAGGA CGCACCAGAC ACCTATCACT ACGTGGTCAG TGAGCCACTG 
1621 GGACGGAACA GCTATAAGGA GCGCTACCTG TTCGTGTACA GGCCTGACCA GGTGTCTGCG 
1681 GTGGACAGCT ACTACTACGA TGATGGCTGC GAGCCCTGCG GGAACGACAC CTTCAACCGA 
1741 GAGCCAGCCA TTGTCAG6TT CTTCTCCCGG TTCACAGAGG TCAGGGAGTT TGCCATTGTT 
1801 CCCCTGCATG CGGCCCCGGG GGACGCAGTA GCCGAGATCG ACGCTCTCTA TGACGTCTAC 
1861 CTGGATGTCC AAGAGAAATG GGGCTTGGAG GACGTCATGT TGATGGGCGA CTTCAATGCG 
1921 GGCTGCAGCT ATGTGAGACC CTCCCAGTGG TCATCCATCC GCCTGTGGAC AAGCCCCACC 
1981 TTCCAGTGGC TGATCCCCGA CAGCGCTGAC ACCACAGCTA CACCCACGCA CTGTGCCTAT 
2041 GACAGGATCG TGGTTGCAGG GATGCTGCTC CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT 
2101 CCCTTTAACT TCCAGGCTGC CTATGGCCTG AGTGACCAAC TGGCCCAAGC CATCAGTGAC 
2161 CACTATCCAG TGGAGGTGAT GCTGAAGTGA 
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ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 

TYICNV KHKPSNTKVDKK 

711 720 729 73B 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

VE .PK SCDKTHTCPPC P A P 

765 774 783 792 ' 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 

E LLGGPSVFLFPPKPKDT 

819 828 837 846 855 864 

CTC ATG ATC TCC CGG ACC. CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 

LM IS RT P EVTC V VV D V S H 

873 882 891 900 909 ' 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

EDPEVKFNWYVDGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACQ TAC CGT GTG GTC AGC 

AKTKP .REEQYNSTYRVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 

VLTVLHQDWLNGKEYKCK 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 

VSNKALPAPIEKTISKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC CGA GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 

GQPRBPQV.YTLPPSRDEL 

1143 1152 1161 1170 1179 1188 

ACC AAG AAC CAG GTC AGC CTG ACC TGC CTG GTC AAA GGC TTC TAT CCC AGC GAC 

TKNQVSLTCLVKGF .Y P SD 

1197 1206 1215 1224 1233 1242. 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 

lAVEWESNGQPENNYKTT 

125X 1260 1269 1278 1287 1296 

CCT CCC GTG CTG GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 

PPVLDSDGSFFLYSKLTV 

1305 1314 1323. 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 

OKSRWQQGNVFSCSVMHE 
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1359 



1368 



1377 



1386 



1395 



1404 



GCT CTG CAC AAC CAC TAG ACG CAG AAG AGC CTC TCC CTG TCC CCG GGG AGC GGC 

ALHNHYTQKS LSLSP G S G 

1413 1422 1431 1440 1449 1458 

GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG ATG TCC 

G L KIAAFNIQTFGETKMS 

1467 1476 1485 1494 1S03 1512 

aat gcc acc ctc gtc agc tac att gtg cag atc ctg agc ogc tac gac atc gcc 
natlVsyivqzlsrydia 

1521 1530 1539 1548 1557 1566 

CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG GAC 

LVQBVRDSHLTAV 6KLLD 

1575 1584 1593 1602 1611 1620 

AAC CTC AAT CAG GAD GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA CTG 

NLNQDA PD TYHYVVS E P L 

1629 1638 1647 1656 1665 1674 

GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC A6G OCT GAC CAG GTG 

GRNSYKERYLFVYRPDQV 

1683 1692 1701 1710 1719 1728 

TCT 6CG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG AAC GAC 

SAVDSYYYDDGCEPCG MD 

1737 1746 1755 1764 1773 1782 

ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG GTC 

TFNREPAIVRFFSRFTEV 

1791 XSTTO 1809 -I'SIB 1827 1836 

AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC GAG 

RBFAIVPLHAAPGDAVAE 

1845 1B54 1863 1872 1881 1890 

ATC GAC GCT CTC TAT GAC GTC TAC CTG. GAT GTC CAA GAG AAA TGG GGC TTG GAG 

IDALYD VYLDVQEKWG L E 

1899 1908 1917 1926 1935 1944 

GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA CCC TCC 

DVKLMGDF NAG C SY.V R P S 

1953 1962 1971 1980 1989 1998 

CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG ATC CCC 

QWSSIRLWTSP .TFQWLI P 

2007 2016 2025 2034 2043 2052 

GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC GTG 

DSADTTATPTHCAYDR I V 
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2061 2070 2079 2088 2097 2106 

GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC TTT 

VAGMLLRGAVVPpSALP F 

2115 2124 2133 2142 2151 2160 

AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT GAC 

NFQA AYGLSDQLAQAIS D 

2169 2178 2187 

CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

HYPVEVMLK* 



Fig. 9(B) 
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PAS37 

LOCUS PAS37.DNA 2226 bp 2196 bp 2196 bp DNA 14-AUG- 

1998 

DEFINITION HUMANISED HMFGl heavy chain fused to hiaman DNAse construct 37 
DEFINITION Clone 16.4.2 (same as hcdnasel.dna teir^late file) plus NLS 
REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFGl etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 
COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase.dna) 

COMMENT The fusion was made using overlapping oligos AS79 and AS80 
FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 
FEATURES Residue 963 is G > T leading to silent mutation in all clones 

SITES Note 
BASE COUNT 511 a 683 c 619 g 413 t 

ORIGIN ? 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACtCC 
1261 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGGGTAA AGGGAGCGGC GGG CTGAAGA TCGCAGCCTT CAACATCCAG 
1441 ACATTTGGGG AGACCAAGAT GTCCAATGCC ACCCTCGTCA GCTACATTGT GCAGATCCTG 
1501 AGCCGCTACG ACATCGCCCT GGTCCAGGAG GTCAGAGACA GCCACCTGAC TGCCGTGGGG 
1561 AAGCTGCTGG ACAACCTCAA TCAGGACGCA CCAGACACCT ATCACTACGT GGTCAGTGAG 
1621 CCACTGGGAC GGAACAGCTA TAAGGAGCGC TACCTGTTCG TGTACAGGCC TGACCAGGTG 
1681 TCTGCGGTGG ACAGCTACTA CTACGATGAT GGCTGCGAGC CCTGCGGGAA CGACACCTTC 
1741 AACCGAGAGC CAGCCATTGT CAGGTTCTTC TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC 
1801 ATTGTTCCCC TGCATGCGGC CCCGGGGGAC GCAGTAGCCG AGATCGACGC TCTCTATGAC 
1861 GTCTACCTGG ATGTCCAAGA GAAATGGGGC TTGGAGGACG TCATGTTGAT GGGCGACTTC 
1921 AATGCGGGCT GCAGCTATGT GAGACCCTCC CAGTGGTCAT CCATCCGCCT GTGGACAAGC 
1981 CCCACCTTCC AGTGGCTGAT CCCCGACAGC GCTGACACCA CAGCTACACC CACGCACTGT 
2041 GCCTATGACA GGATCGTGGT TGCAGGGATG CTGCTCCGAG GGGCCGTTGT TCCCGACTCG 
2101 GCTCTTCCCT TTAACTTCCA GGCTGCCTAT GGCCTGAGTG ACCAACTGGC CCAAGCCATC 
2161 AGTGACCACT ATCCAGTGGA GGTGATGCTG A AGGGGGGCG GACCCAftAAA GRAGCGCa^ 

2221 GTTTGA I 

- I ^NLS 



// 
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9 18 27 36 45 54 

5' ATG C3GA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 

MGWSCI ILFLVATATGVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

Q VQLVQSGAEVKKPGAS 

117 126 135 144 153 16-2 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSC KASGYTFSAYWI E 

171 180 189 1S8 207 21£ 

TGG GTC CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

'WVRQAPGKGLEWVGB IL<P 

^25 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNS RYN EttFKG R V T V T 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMBXSS LRS3 

333 342 351 360 ^ 369 378 

GAC ACA X5CC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAWFA.Y 

387 396. 405 414 ' 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WGQGTLVTVSSASTKG P S 

441 450 ' 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VF P LA P SS KSTSG G T A A L 

495 504 513 522 ^ 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDY FPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTIC CTA CAG TCC TCA GGA 

GALTSGVHTF- PAVtQ.S S C 

603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LY. S L S S VVT V P S S S L G T Q 

657 666 675 634 693 702 

Fig. 10(B) 
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1359 



1368 



1377 



1386 



1395 



1404 



OCT CTG CAC kkC CAC TAG ACC CAG AAG AGC CTC TCC CTG TCT CCG GGT AAA GGC 

ALHNHYTQKSLSLSPGK 

1413 1422 1431 1440 1449 1458 

AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 

S G G LKIAAFNIQTFGETK 

1467 1476 1485 14^4 1503 1512 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 

MSNATLVS YIVQILS RYD 

1521 1530 1539 1548 1557 1566 

ATC GCC CTG GTC CAG GAG GTC ACA GAC AGC CAC CTG ACT GCC GTG GOG AAG CTG 

lALVQEVRDS HLTAVG KL 

1575 1584 1593 1602 1611 1620 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 

I.DNLHQDAPDTYHYVVSE 

1629 1638 1647 1656 1665 1674- 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 

PLGRNSYKBRYLFVYRPD 

.1683 1692 1701 1710 1719 1728 

CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG 

QVSAVDSYYYDDGCEPCG 

1737 1746 1755 1764 1773 1782 

AAC GAC ACC TTC AAC OGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA 

NDTF NREPA.IVRFFS RFT 

1791 1800 1809 1818 1827 1836 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 

EVREFAIVPLHAAPGDAV 

L84S 1854 1863 1872 1881 1890 

GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC 

AEIDALYDVYLDVQEKWG 

1899 1908 1917 1926 1935 1944 

TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GOG GCC TGC ACC TAT GTG AGA 

LE0VMLMG0FNA6CSYVR 

1953 1962 1971 1980 1989 * 1998 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG 

PS.QWSS IRLWTS PTFQWL 

2007 2016 2025 2034 2043 2052 

ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 

IPDSADTTATPTHCAY DR 
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2061 2070 2079 2088 2097 2106 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 

IVVAGMLLRGAVVPDSAL 

2115 2124 2133 2142 2151 2160 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 

pp. NFQAAYGLSDQLAQAI 

2169 217 8 2187 2196* 2205 2214 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA AAG AAG 



S D 



HYPVEVMLK G G G P K K K 



2223 

OGC AAG GTT TGA 3' 



R K V 



Fig. 10(B) 
(Sheet 4 of 4) 



wo 01/74905 



PCT/GBOl/01324 



42/113 
DAS38 



LOCOS PAS38.DNA 2223 bp 2193 bp DNA 14-AUG-1998 

DEFINITION HUMANISED HMFGl heavy chain fused to human DNAse construct 38 

DEFINITION Clone 17.12.1 with silent K to K mutation (1398 A > G)+NLS 
REFERENCE 

AUTHORS VERHOEYEN ET KL 

TITLE CONSTRUCTION OF RESHAPED HMFGl etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase.dna) 

COMMENT The fusion was made using overlapping oligos AS81 and AS82 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

FEATURES In 17.12.1 residue 1398 is A > G (silent K to K mutation) 

SITES Note 
BASE COUNT 510 a 683 c 618 g 412 t 

ORIGIN ? 

1 ATGG(5ATGGA GCTGTATCAT CCTCTTCTTG GTAGGAACA6 CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC . 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACA7UV6C CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
] 1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAT^ GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
1261 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT CTCCGAA aSG GAGCGSCSGG CTGAAGATCG CAGCCTTCAA CATCCAGACA 
1441 TTTGGGGAGA CCAAGATGTC CAATGCCACC CTCGTCAGCT ACATTGTGCA GATCCTGAGC 
1501 CGCTACGACA TCGCCCTGGT CCAGGAGGTC AGAGACAGCC ACCTGACTGC CGTGGGGAAG 
1561 CTGCTGGACA ACCTCAATCA GGACGCACCA GACACCTATC ACTACGTGGT CAGTGAGCCA 
1621 CTGGGACGGA ACAGCTATAA GGAGCGCTAC CTGTTCGTGT ACAGGCCTGA CCAGGTGTCT 
1681 GCGGTGGACA GCTACTACTA CGATGATGGC TGCGAGCCCT GCGGGAACGA CACCTTCAAC 
1741 CGAGAGCCAG CCATTGTCAG GTTCTTCTCC CGGTTCACAG AGGTCAGGGA GTTTGCCATT 
1801 GTTCCCCTGC ATGCGGCCCC GGGGGACGCA GTAGCCGAGA TCGACGCTCT CTATGACGTC 
1861 TACCTGGATG TCCAAGAGAA ATGGGGCTTG GAGGACGTCA TGTTGATGGG CGACTTCAAT 
1921 GCGGGCTGCA GCTATGTGAG ACCCTCCCAG TGGTCATCCA TCCGCCTGTG GACAAGCCCC 
1981 ACCTTCCAGT GGCTGATCCC CGACAGCGCT GACACCACAG CTACACCCAC GCACTGTGCC 
2041 TATGACAGGA TCGTGGTTGC AGGGATGCTG CTCCGAGGGG CCGTTGTTCC CGACTCGGCT 
2101 CTTCCCTTTA ACTTCCAGGC TGCCTATGGC CTGAGTGACC AACTGGCCCA AGCCATCAGT 
2161 GACCACTATC CAGTGGAGGT GATGCTGA AG GGGGGCGGAC CCAAAAAGAA GCGCAAGGTT 
2221 TGA ===== 



// 



I ^NLS 

Fig. 11(A) 
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ACC TAC ATC TGC AAC GTG AAT CAC AAC CCC AGC AAC ACC AAG GTC GAC AAG AAA 
TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 

VEPKSCDKTHTCPPCPAP 

765 774 7^3 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 

ELLGGPSVFLF PPKPKDT 

819 828 837 846 855 864 

CTt ATG ATC TGC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 

LMISRTPEVTC VVVDVSH 

873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

EDPEVKFNWYV DGVEVHN 

927 936 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC AOG TAC CGT GTG GTC AGC 

AKTKPREEQYN S T.YRVVS 

981 990 999 1008 1017 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 

VLTVLH QDWLN G K E YK C K 

1035 1044 1053 1062 1071 1080 

GTC TCC AAC AAA GCC CTC CCA GCC CCC ATC GAG AAA ACC ATC TCC AAA GCC AAA 

VSNKAL PAPIEKTISKAK 

1089 1098 1107 1116 1125 1134 

GGG CAG CCC 06A GAA CCA CAG GTG TAC ACC CTG CCC CCA TCC CGG GAT GAG CTG 

GQPREPQVYTLPPSRDEL 

1143 1152 1161 1170 1179 1188 

acc aag aac cag gtc agc ctg acc tgc ctg gtc aaa ggc ttc tat ccc agc gac 
tknqv.sl'tclvkgfypsd 

1197 1206 1215 1224 1233 1242 

ATC GCC GTG GAG TGG GAG AGC AAT GGG CAG CCG GAG AAC AAC TAC AAG ACC ACG 

I. AVEWESNGQPENNYKTT 

1251 1260 1269 1278 1287 1296 

CCT CCC GTG CTC GAC TCC GAC GGC TCC TTC TTC CTC TAC AGC AAG CTC ACC GTG 

PPVLDS D G. SF F L YSKLTV 

.1305 1314 1323 • 1332 1341 1350 

GAC AAG AGC AGG TGG CAG CAG GGG AAC GTC TTC TCA TGC TCC GTG ATG CAT GAG 

DKSRWQQGNVFSCSVMH E 
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1359 1368 1377 1386 139S 1404 

GCT CTG CAC AAC CAC TAC ACG CAG AAG AGC CTC TCC CTG TCT CCG AAG GGG AGC 

ALHNHYTQKSL. SLSPK G S 

1413 1422 1431 1440 1449 1458 

GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG ATG 

G G LKIAAFNIQTFGETKM 

1467 1476 • 1485 1494 1503 1512 

TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC ATC 

SNATiVSYlVQILSRYDI 

1521 1530 1539 1548 1557 1566 

GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG 

ALVQEVRDSHLTAVGKLIi 

1575 1584 1593 1602 1611 1620 

GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA 

DNLNQDAPDTYHYVVS E P 
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2061 2070 2079 2088 2097 2106 

GTG GTT GCA GGG ATG CTG CTC CX3A GGG GCC GTT GTT CCC GAC TCG GCT CTT CCC 

VVAGMLLRGAVVPDSAL P 

2115 2124 2133 2142 2151 2160 

TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC AGT 

FNFQAAY -GL-SDQLAQA I S 
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PAS39 



LOCUS PAS39.DNA 2220 bp 2190 bp DNA 14-AUG- 
1998 

DEFINITION HUMANISED HMFGl heavy chain fused to human DNAse - construct 39 

DEFINITION Clone 18.24.1 with residue 1392 T > C +NLS 

REFERENCE 

AUTHORS VERHOEYEN ET AL 

TITLE CONSTRUCTION OF RESHAPED HMFGl etc 

JOURNAL IMMUNOL. (1993) :78, 364-370 

COMMENT Human DNAse sequence is modified as a result of oligo assembly 
(mhdnase.dna) 

COMMENT The fusion was made using overlapping oligos AS83 and AS84 

FEATURES AA RESIDUE 235 HAS NOT BEEN CHANGED TO KABAT (I.E. V TO A) 

FEATURES Residue 963 is G > T leading to silent mutation in all clones 

FEATURES Residue 1392 T > C silent S to S mutation 

SITES Note 
BASE COUNT 508 a 684 c 617 g 411 t 

ORIGIN ? 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAG6TGT CCACTCCC AG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGC3G ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 TiACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
781 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
841 TGCGTGGTGG TG(3ACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
901 GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
961 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGTUITGGCAA GGAGTACAAG 
1021 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
1081 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
1141 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
1201 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
1261 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
1321 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 
1381 CTCTCCCTGT nnCn GGGGAG CGGCSGGC TG AAGATCGCAG CCTTCAACAT CCAGAC3^TTT 
1441 GGGGAGACCA AGATGTCCAA TGCCACCCTC GTCAGCTA(:A TTGTGCAGAT CCTGAGCCGC 
1501 TACGACATCG CCCTGGTCCA GGAGGTCAGA GACAGCCACC TGACTGCCGT GGGGAAGCTG 
1561 CTGGACAACC TCAATCAGGA CGCACCAGAC ACCTATCACT ACGTGGTCAG TGAGCCACTG 
1621 GGACGGAACA GCTATAAGGA GCGCTACCTG TTCGTGTACA GGCCTGACCA GGTGTCTGCG 
1681 GTGGACAGCT ACTACTACGA TGATGGCTGC GAGCCCTGCG GGAACGACAC CTTCAACCGA 
1741 GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG TTCACAGAGG TCAGGGAGTT TGCCATTGTT 
1801 CCCCTGCATG CGGCCCCGGG GGACGCAGTA GCCGAGATCG ACGCTCTCTA TGACGTCTAC 
1861 CTGGATGTCC AAGAGAAATG GGGCTTGGAG GACGTCATGT TGATGGGCGA CTTCAATGCG 
1921 GGCTGCAGCT ATGTGAGACC CTCCCAGTGG TCATCCATCC GCCTGTGGAC AAGCCCCACC 
1981 TTCCAGTGGC TGATCCCCGA CAGCGCTGAC ACCACAGCTA CACCCACGCA CTGTGCCTAT 
2041 GACAGGATCG TGGTTGCAGG GATGCTGCTC CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT 
2101 CCCTTTAACT TCCAGGCTGC CTATGGCCTG AGTGACCAAC TGGCCCAAGC CATCAGTGAC 
2161 CACTATCCAG TGGAGGTGAT GCTGA AGGGG GGCGGACCC& A&ftAGaAGCG CAAGGTTTGm 
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ACC TAG ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 
TYICNVNHKPSNTKVDKK 

111 720 729*' 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC- CCA CCG TGC CCA GCA CCT 



765 774 783 792 801 810 

GAA CTC CTG GGG GGA CCG TCA GTC TTC CTC TTC CCC CCA AAA CCC AAG GAC ACC 



819 828 837 • 846 855 864 

CTC ATG ATC TCC CGG ACC CCT GAG GTC ACA TGC GTG GTG GTG GAC GTG AGC CAC 



873 882 891 900 909 918 

GAA GAC CCT GAG GTC AAG TTC AAC TGG TAC GTG GAC GGC GTG GAG GTG CAT AAT 

EDPEVKFNWyVDGVEVHN 

927. 936 • 945 954 963 972 

GCC AAG ACA AAG CCG CGG GAG GAG CAG TAC AAC AGC ACG TAC CGT GTG GTC AGC 
— — — — — — — — — — 

AKTKPREEQYNSTYRVVS 

981' 990 999 1008 X017 ' 1026 

GTC CTC ACC GTC CTG CAC CAG GAC TGG CTG AAT GGC AAG GAG TAC AAG TGC AAG 



1035 1044 1053 1062 1071 1080 
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CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG CTG GAC 
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AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG CCA* CTG 
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.GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC CAG GTG 
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ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA GAG GTC 

TFNREPAIVRFFSRPTEV 

1791 IBOO 1809 1818 1827 1836 

AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA GCC GAG 

REFAIVPLHAAPGDAVAE 

1845 1854 1863. 1872 1881 1890 

ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC TTG GAG 

IDALYDVYLDVQEKWGLE 

1899 1908 1917 1926 1935 1944 

GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA CCC TCC 

DVMLMGDFWAGCSYVRPS 

1953 1962 1971 1980 1989 'l998 

CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG. TGG CTG ATC CCC 

Q WSSIRLWTSPTFQWLl P 

2007 2016 2025 2034 2043 2052 

GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG ATC GTG 

DSAD TTATPTHCAYDRIV 
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2061 2070 2079 20B8 2097 2106 

CTT CCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC' TCG GOT CTT CCC TTT 
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pASlOl 

LOCUS PASIOI.DNA 1548 bp mRNA PRI 06-MaR-1995 

DEFINITION Humanised HMFGl Fab '2 fused to human DNase I (pASlOl) 
ACCESSION 
NID 

KEYWORDS DNase I. . ^ y^r 

SOURCE DNase I sequence is tram assembled oligos (thus modified c/£ 

MHDNASEl.dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
AUTHORS Shakes, r Capon,D.J., Hellmiss,R., Marsters^S.A. and Baker, C.L. 
TITLE Recombinant human DNase I reduces the viscosity of cystic 
fibrosis 

sputum 

JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9188-9192 {1990) 

MEDLINE 91067672 
BASE COUNT 343 a 467 c 430 g 308 t 

ORIGIN 

1 ATGGGA TGGA 6CTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAA6GTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAA£^^^CTGAA GATCGCAGCC 
781 TTCAACATCC AGACATTTGG GGAGACCAAG ATGTCCAATG CCACCCTCGT CAGCTACATT 
841 GTGCAGATCC TGAGCCGCTA CGACATCGCC CTGGTCCAGG AGGTCAGAGA CAGCCACCTG 
901 ACTGCCGTGG GGAAGCTGCT GGACAACCTC AATCAGGACG CACCAGACAC CTATCACTAC 
961 GTGGTCAGTG AGCCACTGGG ACGGAACAGC TATAAGGAGC GCTACCTGTT CGTGTACAGG 
1021 CCTGACCAGG TGTCTGCGGT GGACAGCTAC TACTACGATG ATGGCTGCGA GCCCTGCGGG 
1081 AACGACACCT TCAACCGAGA GCCAGCCATT GTCAGGTTCT TCTCCCGGTT CACAGAGGTC 
1141 AGGGAGTTTG CCATTGTTCC CCTGCATGCG GCCCCGGGGG ACGCAGTAGC CGAGATCGAC 
1201 GCTCTCTATG ACGTCTACCT GGATGTCCAA GAGAAATGGG GCTTGGAGGA CGTCATGTTG 
1261 ATGGGCGACT TCAATGCGGG CTGCAGCTAT GTGAGACCCT CCCAGTGGTC ATCCATCCGC 
1321 CTGTGGACAA GCCCCACCTT CCAGTGGCTG ATCCCCGACA GCGCTGACAC CACAGCTACA 
1381 CCCACGCACT GTGCCTATGA CAGGATCGTG GTTGCAGGGA TGCTGCTCCG AGGGGCCGTT 
1441 GTTCCCGACT CGGCTCTTCC CTTTAACTTC CAGGCTGCCT ATGGCCTGAG TGACCAACTG 
1501 GCCCAAGCCA TCAGTGACCA CTATCCAGTG GAGGTGATGC TGAAGTGA 
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LOCUS FDDNASElOl 1548 BP SS-DNA SYN 25-AOG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCK 

FEATURES Location/Qualifiers 

frag join (1. .>720,<781. . 1548) 

/note="l to 1548 of PASlOl.dna [Split]" 
frag 721,. 780 

/note="l to 60 of 101/1051inker" 
frag join(721. .>735,<736. .>759,<760. .>780) 

/note="l to 80 of 1021inker [Split]" 
BASE COUNT 343 A 465 C 431 G 309 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACA?VGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGTCC ACCGTGTCCA GCACCAGAGG GCGGGCTGAA GATCGCAGCC 
.781 TTCAACATCC AGACATTTGG GGAGACCAAG ATGTCCAATG CCACCCTCGT CAGCTACATT 
j . 841 GTGCAGATCC TGAGCCGCTA CGACATCGCC CTGGTCCAGG AGGTCAGAGA CAGCCACCTG 
901 ACTGCCGTGG GGAAGCTGCT GGACAACCTC AATCAGGACG CACCAGACAC CTATCACTAC 
961 GTGGTCAGTG AGCCACTGGG ACGGAACAGC TATAAGGAGC GCTACCTGTT CGTGTACAGG 
1021 CCTGACCAGG TGTCTGCGGT GGACAGCTAC TACTACGATG ATGGCTGCGA GCCCTGCGGG 
1081 AACGACACCT TCAACCGAGA GCCAGCCATT GTCAGGTTCT TCTCCCGGTT CACAGAGGTC 
1141 AGGGAGTTTG CCATTGTTCC CCTGCATGCG GCCCCGGGGG ACGCAGTAGC CGAGATCGAC 
1201 GCTCTCTATG ACGTCTACCT GGATGTCCAA GAGAAATGGG GCTTGGAGGA CGTCATGTTG 
1261 ATGGGCGACT TCAATGCGGG CTGCAGCTAT GTGAGACCCT CCCAGTGGTC ATCCATCCGC 
1321 CTGTGGACAA GCCCCACCTT CCAGTGGCTG ATCCCCGACA GCGCTGACAC CACAGCTACA 
.1381 CCCACGCACT GTGCCTATGA CAGGATCGTG GTTGCAGGGA TGCTGCTCCG AGGGGCCGTT 
. 1441 GTTCCCGACT CGGCTCTTCC CTTTAACTTC CAGGCTGCCT ATGGCCTGAG TGACCAACTG 
/ 1501 GCCCAAGCCA TCAGTGACCA CTATCCAGTG GAGGTGATGC TGAAGTGA 

// 



Fig. 13(B) 
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LOCUS FDDNASElbl 1557 BP SS-DNA SYN 29-MrG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag " 10. .1557 

/note="l to 1548 of FdDNaselOlcorrect" 
frag join(10. .>729,<790. .1557) 

/note="l to 1548 of PASlOl.dna [Split]" 
frag 730.. 789 

/note="l to 60 of 101/1051inker" 
frag join(730. .>744/<745. .>768,<769. .>789) 

/note="l to 80 of 1021inker [Split]" 
BASE CODNT 344 A 471 C 433 G 309 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTAITA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAaGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
: 601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 

661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGTCCA CCGTGTCCAG CACCAGAGGG CGGGCTGAAG 
781 ATCGCAGCCT TCAACATCCA GACATTTGGG GAGACCAAGA TGTCCAATGC GACCCTCGTC 
841 AGCTACATTG TGCAGATCCT GAGCCGCTAC GACATCGCCC TGGTCCAGGA GGTCAGAGAC 
901 AGCCACCTGA CTGCCGTGGG GAAGCTGCTG GACAACCTCA ATCAGGACGC ACCAGACACC 
961 TATCACXACG TGGTCAGTGA GCCACTGGGA CGGAACAGCT ATAAGGAGC6 CTACCTGTTC 
1021 GTGTACAGGC CTGACCAGGT GTCTGCGGTG GACAGCTACT ACTACGATGA TGGCTGCGAG 
1081 CCCTGCGG6A ACGACACCTT CAACCGAGAG CCAGCCATTG TCAGGTTCTT CTCCCGGTTC 
1141 ACAGAGGTCA GGGAGTTTGC CATTGTTCCC CTGCATGCGG CCCCGGGGGA CGCAGTAGCC 
1201 GAGATCGACG CTCTCTATGA CGTCTACCTG GATGTCCAAG AGAAATGGGG CTTGGAGGAC 
1261 GTCATGTTGA TGGGCGACTT CAATGCGGGC TGCAGCTATG TGAGACCCTC CCAGTGGTCA 
1321 TCCATCCGCC TGTGGl^CARG CCCCACCTTC CAGTGGCTGA TCCCCGACAfi CGCTGACACC 
1381 ACAGCTACAC CCACGCACTG TGCCTATGAC AGGATCGTGG TTGCAGGGAT GCTGCTCCGA 
1441 GGGGCCGTTG TTCCCGACTC GGCTCTTCCC TTTAACTTCC AGGCTGCCTA TGGCCTGAGT 
1501 GACCAACTGG CCCAAGCCAT CAGTGACCAC TATCCAGTGG AGGTGATGCT GAAGTGA 

// 
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.9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 

MGWSCXILFLVATATGVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

SQVQLVQSGAEVK KPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAG TGG ATA GAG 

VK VSCKASGYTFSAY- WIE 

171 180 189 198 207 ' 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKGLEWVGE I LP 

225 * 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GS NNSRYNEKFKGRVTVT 

279 288 297 306 315 . 324' 

AGA GAC ACA TCC ACA AAC ACA GCC TAC- ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMELSSLRSE 

333 342 351 360 ■ 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

D TAVYYCARSYDFAWFAY 

387 396 405 414 423 432 * 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

W G Q. G T L V T V S S A S T -K G P S 

V 

. 441 450 459 . 468 . 477 466 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAPSSKS TSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG A^C TCA 

GC LV K DY FPE PV TV. S WN S 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGVHTFPAV. LQSSG 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG" ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYSLSSVVTVPSSSL GTQ- 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNHKPSNTKVDKK 

711 72X) 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA AtT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



V EPKSCDKTHTCPPCPAP 

765 774 783 792 801 810 

GAA GGC GGG CTG AAG ATC GCA GGC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 



BGGLKIAAFNIQTFGETK 

819 828 837 846 855 864 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 



MSNATLVSYIVQILSRYD 

873 882 891 900 909 918 

ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG 



lALVQEVRDS HL TAVG KL 

927 936 945 954 963 972 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC ACT GAG 



LDNL KQDAPDTYHYV'V S E 

981 990 999 1008 1017 1026 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 



PLGRNSYKERYLFVYRPD 

1035 1044 1053 1062 1071 1080 

CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC GAG CCC TGC GGG 



qvsavd'syyydd'gcepcg 

1089 10'98 1107 1116 1125 1134 

AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA 



NDTFNREPAIVRFFSRiFT 

1143 1152 1161 1170 1179 1188 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 



EVREFAIVP LHAAPGDAV 



1197 1206 1215 1224 1233 1242 
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GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC 

AEIDALYDVYL DVQE KWG 

1251 1260 1269 1278 1287 X296 

TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA 

LE D VM LMG DFNAGC S YV R 

1305 1314 1323 1332 1341 1350 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG 

•PSQWSSIRLWTSPTFQWL 

1359 1368 1377 1386 . 1395 1404 

ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 

I PDSAD TTATPTHCAYDR 

1413 1422 1431 1440 1449 ' 1458 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 

IVVAGMLLRGAVVP D SA L. 

1467 1476 1485 1494 1503 1512 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 

PFNFQ AA.YG LSDQLAQA I 

1521 1530 1539 1548 

AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3' 

SD H .YPVEV.ML. K* 
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PAS102 



PAS102.DNA 1566 bp JfiRNA PRI 
Hiamanised HMFGl Fab '2 fused to himan DNase I 



06-MAR-1995 
(pAS102) 



LOCUS 
DEFINITION 
ACCESSION 
NID 

ECEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASEl.dna) (See Figure 2) 
ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shakes. r Capon^D.J., Hellmiss^R.^ Marsters, S.A. and Baker, C.Ii, 
Recombinant human DNase I reduces the viscosity of cystic 



AUTHORS 
TITLE 
fibrosis 



JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



sputum 
Proc. Natl. 
91067672 
345 a 



Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 
469 c 440 g 312 t 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACA66TGT CCACTCCC AG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA 6TAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTG6 GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACA2\AACTC ACAC ATGCTG TGTGGAGTGC CCACCGTGCC CAGCACCTGA AGGGAGCGGC 
781 GGG CT6AAGA TCGCAGCCTT CAACATCCAG ACATTTGGGG AGACCAAGAT GTCCAATGCC 
841 ACCCTCGTCA GCTACATTGT GCAGATCCTG AGCCGCTACG ACATCGCCCT GGTCCAGGAG 
901 GTCAGAGACA GCCACCTGAC TGCCGTGGGG AAGCTGCTGG ACAACCTCAA TCAGGACGCA 
961 CCAGACACCT ATCACTACGT GGTCAGTGAG CCACTGGGAC GGAACAGCTA TAAGGAGCGC 
1021 TACCTGTTCG TGTACAGGCC TGACCAGGTG TCTGCGGTGG ACAGCTACTA CTACGATGAT 
1081 GGCTGCGAGC CCTGCGGGAA CGACACCTTC AACCGAGAGC CAGCCATTGT CAGGTTCTTC 
1141 TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC ATTGTTCCCC TGCATGCGGC CCCGGGGGAC 
1201 GCAGTAGCCG AGATCGACGC TCTCTATGAC GTCTACCTGG ATGTCCAAGA GAAATGGGGC 
1261 TTGGAGGACG TCATGTTGAT GGGCGACTTC AATGCGGGCT GCAGCTATGT GAGACCCTCC 
1321 CAGTGGTCAT CCATCCGCCT GTGGACAAGC CCCACCTTCC AGTGGCTGAT CCCCGACAGC 
1381 GCTGACACCA CAGCTACACC CACGCACTGT GCCTATGACA GGATCGTGGT TGCAGGGATG 
1441 CTGCTCCGAG GGGCCGTTGT TCCCGACTCG GCTCTTCCCT TTAACTTCCA GGCTGCCTAT 
1501 GGCCTGAGTG ACCAACTGGC CCAAGCCATC AGTGACCACT ATCCAGTGGA GGTGATGCTG 
1561 AAGTGA 



Fig. 14(A) 
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LOCUS FDDNASE102 1566 BP SS-DNA SYN 23-MAR-2001 

DEFINITION - 
ACCESSION 
BCEYWORDS 
SOURCE 

BASE COUNT 345 A 468 C 440 G 313 T 0 OTHER 

.ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 

61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 

121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 

181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 

241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 

301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 

361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 

421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 

481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 

541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 

601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 

661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 

721 GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC CAGCACCAGA GGGGAGCGGC 

781 GGGCTGAAGA TCGCAGCCTT CAACATCCAG ACATTTGGGG AGACCAAGAT GTCCAATGCC 

841 ACCCTCGTCA GCTACATTGT GCAGATCCTG AGCCGCTACG ACATCGCCCT GGTCCAGGAG 
901 GTCAGAGACA GCCACCTGAC TGCCGTGGGG AAGCTGCTGG ACAACCTCAA TCAGGACGCA 
961 CCAGACACCT ATCACTACGT GGTCAGTGAG CCACTGGGAC GGAACAGCTA TAAGGAGCGC 

. 1021 TACCTGTTCG TGTACAGGCC TGACCAGGTG TCTGCGGTGG ACAGCTACTA CTACGATGAT 

1081 GGCTGCGAGC CCTGCGGGAA CGACACCTTC AACCGAGAGC CAGCCATTGT CAGGTTCTTC 

1141 TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC ATTGTTCCCC TGCATGCGGC CCCGGGGGAC 

1201 GCAGTAGCCG AGATCGACGC TCTCTATGAC GTCTACCTGG ATGTCCAAGA GAAATGGGGC 

i 1261 TTGGAGGACG TCATGTTGAT GGGCGACTTC AATGCGGGCT GCAGCTATGT GAGACCCTCC 

1321 CAGTGGTCAT CCATCCGCCT GTGGACAAGC CCCACCTTCC AGTGGCTGAT CCCCGACAGC 

1381 GCTGACACCA CAGCTACACC CACGCACTGT GCCTATGACA GGATCGTGGT TGCAGGGATG 
1441 CTGCTCCGAG GGGCCGTTGT TCCCGACTCG GCTCTTCGCT TTAACTTCCA GGCTGCCTAT 
1501 GGCCTGAGTG ACCAACTGGC CCAAGCCATC AGTGACCACT ATCCAGTGGA GGTGATGCTG 
1561 AAGTGA 

7/ 
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PAS302 

LOCaS FDDNASE302 1575 BP SS-DNA SYN 29-AOG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 1575 

/note="l to 1566 of FdDNasel02correct" 
BASE COUNT 34 6 A 474 C 442 G 313 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCTIAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCT6 CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACT^GAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGCTGT GTCGAGTGTC CACCGTGTCC AGCACCAGAG 
781 GGGAGCGGCG GGCTGAAGAT CGCAGCCTTC AACATCCAGA CATTTGGGGA GACCAAGATG 
841 TCCAATGCCA CCCTCGTCAG CTACATTGTG CAGATCCTGA GCCGCTACGA CATCGCCCTG 
901 GTCCAGGAGG TCAGAGACAG CCACCTGACT GCCGTGGGGA AGCTGCTGGA CAACCTCAAT 
961 CAGGACGCAC CAGACACCTA TCACTACGTG GTCAGTGAGC CACTGGGACG GAACAGCTAT 
1021 AAGGAGCGCT ACCTGTTCGT GTACAGGCCT GACCAGGTGT CTGCGGTGGA CAGCTACTAC 
I 1081 TACGATGATG GCTGCGAGCC CTGCGGGAAC GACACCTTCA ACCGAGAGCC AGCCATTGTC 
' 1141 AGGTTCTTCT CCCGGTTCAC AGAGGTCAGG GAGTTTGCCA TTGTTCCCCT GCATGCGGCC 
1201 CCGGGGGACG CAGTAGCCGA GATCGACGCT CTCTATGACG TCTACCTGGA TGTCCAAGAG 
1261 AAATGGGGCT TGGAGGACGT CATGTTGATG GGCGACTTCA ATGCGGGCTG CAGCTATGTG 
1321 AGACCCTCCC AGTGGTCATC CATCCGCCTG TGGACAAGCC CCACCTTCCA GTGGCTGATC 
1381 CCCGACAGCG CTGACACCAC AGCTACACCC ACGCACTGTG CCTATGACAG GATCGTGGTT 
1441 GCAGGGATGC TGCTCCGAGG GGCCGTTGTT CCCGACTCGG CTCTTCCCTT TAACTTCCAG 
1501 GCTGCCTATG GCCTGAGTGA CCAACTGGCC CAAGCCATCA GTGACCACTA TCCAGTGGAG 
1561 GTGATGCTGA AGTGA 

// 
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9 18 27 . 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 

MGWSCI ILFLVATATGVH 

63 .12 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG A^A AAG CCT GGG GCC TCA 

SQVQLVQSGAEVK-KPGA S 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSCKASGYTFSAYWI E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKGLEWVGEILP 

225 234 243 252 261 270 

' GGA AGT AAT AAT TCT AGR TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMELSSLRS E 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSY. DFAWF AY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WGQGTLVTVSSA STKGPS 

441 450 459 468 - 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG .AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFP. LAPSSKSTSGGT AAI* 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSW'NS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGVHTFPAVLQSSG 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPSSSLGTQ 

657 666 675 6B4 . 693 702 

•ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNHKPS- NTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 



VEPKSCDKTHTCCVEC PP 

765 774 78a 792 801 8X0 

TGC CCA GCA CCT GAA G6G AGC GGC GGG CT6 AAG ATC GCA GCC TTC AAC ATC CAG 



CPAPEGSGGLKIAAFNIQ 

819 828 837 846 855 864 

ACA TTT GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG 



TPGETKMSNATLVSYIVQ 

873 882 891 900 909 918 

ATC CTG AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG 



ILSRYDIALV^QEVRDSHL 

927 936 945 954 963 972 

ACT GCC GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT 



TAVGKLLDNLNQDAPDTY 

981 990* 999 1008 1017 1026 

CAC TAC GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG 

HYVVSEPL'GRNSYKERYL 

1035 1044 1053 1062 1071 1080 

TTC GTG TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT 



FVYRPOQVSAVDSYYY DD 

1089 1098 1107 1116 1125 1134 

GGC TGC GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG 



GCEP CGNDTFNREPAIVR 

1143 1152 1161 1170 1179 1188 

TTC TTC TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG 



FF S R FTEV.R E FA I V P L H A 
1197 1206 1215 1224 1233 1242 
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GCC CCG GGG GAC GCA GTA. GCC GAG ATC GAC GCT CTC TAT GAC GTC TAG CTG GAT 

APGDAVAEIDALYDVYLD 

1251 1260 1269 1278 1287 1296 

- GTC CAA GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG 

VQEKWGLEDVMLMGDFNA 

1305 1314 1323 1332 1341 1350 ' 

GGC TGC AGO TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC ' 

GCSYVRPSQWS S I RLWTS 

1359 1368 1377 1386 1395 1404 

CCC ACC TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG 

PTFQWLIP DSADTTATPT 

1413 1422 1431 1440 1449 1458 

CAC TGT GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT 

HCAYDRIVVAGMLLRGAV 

1467 1476 1485 1494 1503 1512 

GTT CCC GAC TCG GCT CTT" CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC 

VP DSALPFNFQAA.YGLSD 

1521 1530 15'39 1548 1557 1566 

CAA CTG GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3' 

QLAQAISDHYPVE VMLK* 
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DAS103 

LOCOS PAS103.DNA 1560 bp loRNA PRI 06-MAR-1995 

DEFINITION Humanised HMFGl Fab '2 fused to human DNase I (pAS103) 
ACCESSION 
NID 

KEYWORDS DNase I. j.^r- ^ /^r 

SOURCE DNase I sequence is from assembled oligos (thus modified c/i 

MHDNASEl.dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
AUTHORS ShakrS., Capon^D-J-r fiellmiss,R., Marsters, S. A. and Baker rC.L. 
Recombinant human DNase I reduces the viscosity of cystic 



TITLE 
fibrosis 



sputum 

Proc. Natl. Acad. Sci. U.S.A. B7 (23), 9188-9192 (1990) 



JOURNAL 

MEDLINE 91067672 
BASE COUNT 344 a 468 c 436 g 312 t 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG 6TAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAA2^GCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAAGACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC i^rarATfiOTG TGTGGAG TGC CCACCGTGCC CAGC ACGPGA AGGCGGGCTG 
781 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 
841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT GGGGAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 
961 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 
1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 
1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 
1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 
1441 CGA6GGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT GCTGAAGTGA 
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LOCUS FDDHASE103 1560 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 

FEATURES Location/Qualifiers 

frag joind- .>720,<793. .1560) 

/note«"l to 1560 of PAS103.dna f Split ] " 
frag 721.. 7 92 

/note-"l to 72 of 103/10 71inker" 
frag . join (721 . .>771, <772. . 792) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 344 A 467 C 436 G 313 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGC2VAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGGGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
j 721 GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC CAGCACCAGA GGGCGGGCTG 

I 781 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 

841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT GGGGAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 
961 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 
1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 
1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 
1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 
1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCA6GCTGC CTATGGCCTG 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT GCTGAAGTGA 

// 
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LOCOS FDDNASE103 1569 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOORCE 

FEATURES Location/Qualifiers 
frag 10.. 1569 

/note="l to 1560 of FdDNasel03correct" 
frag joindO. •>729,<802, .1569) 

/note^"! to 1560 of PAS103.dna [Split]" 
frag 730 -.801 

/note="l to 72 of 103/1071inker" 
frag join{730. •>780r<781. .801) 

/note«"l to 78 of 1021±nker [Split]" 
BASE COUNT 345 A 473 C '438 G 313 T 0 OTHER 

ORIGIN 

1 GCC6CCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GATIAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGCTGT GTCGAGTGTC CACCGTGTCC AGCACCAGAG 
I 781 GGCGGGCTGA AGATCGCAGC CTTCAACATC CAGACATTTG GGGAGACCAA GATGTCCAAT 

' 841 GCCACCCTCG TCAGCTACAT TGTGCAGATC CTGAGCCGCT ACGACATCGC CCTGGTCCAG 

901 GAGGTCAGAG ACAGCCACCT GACTGCCGTG GGGAAGCTGC TGGACAACCT CAATCAGGAC 
961 GCACCAGACA CCTATCACTA CGTGGTCAGT GAGCCACTGG GACGGAACAG CTATAAGGAG 
1021 CGCTACCTGT TCGTGTACAG GCCTGACCAG GTGTCTGCGG TGGACAGCTA CTACTACGAT 
1081 GATGGCTGCG AGCCCTGCGG GAACGACACC TTCAACCGAG AGCCAGCCAT TGTCAGGTTC 
1141 TTCTCCCGGT TCACAGAGGT CAGGGAGTTT GCCATTGTTC CCCTGCATGC GGCCCCGGGG 
1201 GACGCAGTAG CCGAGATCGA CGCTCTCTAT GACGTCTACC TGGATGTCCA AGAGAAATGG 
1261 GGCTTGGAGG ACGTCATGTT GATGGGCGAC TTC2\ATGCGG GCTGCAGCTA TGTGAGACCC 
1321 TCCCAGTGGT CATCCATCCG CCTGTGGACA AGCCCCACCT TCCAGTGGCT GATCCCCGAC 
1381 AGCGCTGACA CCACAGCTAC ACCCACGCAC TGTGCCTATG ACAGGATCGT GGTTGCAGGG 
1441 ATGCTGCTCC GAGGGGCCGT TGTTCCCGAC TCGGCTCTTC CCTTTAACTT CCAGGCTGCC 
1501 TATGGCCTGA GTGACCAACT GGCCCAAGCC ATCAGTGACC ACTATCCAGT GGAGGTGATG 
1561 CTGAAGTGA 

// 



Fig. 15(C) 



wo 01/74905 



PCT/GBOl/01324 



67/113 

9 18 27. 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG OTA GCA ACA GCT ACA GGT GTC CAC 

MGW. SCI ILFLVATATGVH 

63 12 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 

SQVQLVQSGAEVKKPGAS 

117 126 135 144 153 * ' 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSCKAS. GYTFSAYWI E 

171 180 189 198 207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKGLEWVGEILJ 

225 . 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSRYNEKFKGRVTVT 

279 288 29? 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RD TSTNTAYMELSS LRS E 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

WG QGTL VTVSSA ST KG P S 

441 450 459 468 ' 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VF PLAPS SKSTS GG TAA L 

495 504 513 " 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVSWNS 

549 558 . 567 . 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGVHTFPAVLQSSG 
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603 612 621 630 €39 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYSLS.SVVTVP SSSLGTQ 

657 666 675 ' 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAG AAG AAA 



TYICNVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 



VEPKSCDKTHTCCVEC PP 

765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT 



CPAPEGGLKIAAFNIQTP 

819 828 837 846 855 864 

GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG 



GETKMSNATLVSYIVQIL 

873 882 891 900 909 918 

AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC 



SRYDI^ALVQEVRDSHLTA 

927 93*6 "945 954 9-63 972 

GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC 

VGKLLDNLNQDAPDTYHY 

981 990 33S 1008 1017 1026 

GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG 

VVSEPLGRN S YKERYLFV 

1035 1044 1053 1062 1071 1080 

TAC AGG CCT OAC GAG G5G TC^* GCC GTG .GAC AGC TAC ^C TAC GAT GAT GGC TGC 



YRPDQVSAVDSYYYDDGC 

1089 1098 1107 1116 1125 1134 

GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC 



EPCGNDTFNREPAIVRFF 

1143 1152 1161 1170 1179 11B8 

TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG 



SR .F-TEVREFAIVPLHAAP 
1197 1206 1215 1224 1233 1242 
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GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAG GTC TAG CTG GAT GTC CAA 
GDAVAEIDALYDVYLDVQ 

1251 1260 1269 1278 1287 1296 

GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC 

EKWGLEDVMLMGDFNAGC 

1305 1314 1323 1332 1341 1350 

AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGO CCC ACC 

SYVRP'SQWSSIRLWTS PT 

1359- 1368 1377 1386 1395 1404 

TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA* GCT ACA CCC ACG CAC TGT 

FQWLIP DSADTTATPTHC 

1413 1422 1431 1440 1449 1458 

GCC TAT GAC AGG ATC GTG GTT .GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC 

AYDRIVVAGMLLRGAVVP 

1467 1476 1485 1494 1503 1512 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG 

DSALPFNFQAAYGLSDQL 

1521 1530 1539 1548 1557 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 * 

AQAISDHYPVEVMLK* 
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PAS104 



LOCUS PAS104.DNA 1560 bp mRNA PRI 06-MAR-1995 

DEFINITION Humanised HMFGl Fab' 2 fused to human DNase I (pAS104) 

Position 924 G to A by ggg to gag 

Linker GR instead of GG (position 777) 

ACCESSION 

NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos {thus modified c/f 

MHDNASEl.dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
AUTHORS Shakes., Capon,D-J.r Hellmiss^R., Marsters.S.A. and Baker,C,L. 
TITLE Recombinant human DNase I reduces the viscosity of cystic 
fibrosis 

sputum 

JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 

MEDLIIJE 91067672 
BASE COUNT 346 a 468 C 434 g 312 t 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAA?^GCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
. 421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
J 481 GCCCTGGGCT GCCTGGTCAA- GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 

541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
6^1 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ArAH ATGCTG TGT6GAGTGC CCACCGTGCC CA GCACCTG A AGGCkGG CTG 
781 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCGAA TGCCACCCTC 
841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT G^GAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 
961 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 
1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCGA TTGTCAGGTT CTTCTCCCGG 
1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAfi 
1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 
1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT GCTGAAGTGA 
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LOCOS FDDNASE104 1560 BP SS-DNA SYN 25-ADG-2000 

DEFINITION - 
ACCESSION - • 
KEYWORDS 
SOURCE 

FEATURES Location/Qualifiers 

frag join(l. .>720,<793. .1560) 

/note="l to 1560 of PAS104.dna [Split]" 
frag 721.. 792 

/note="l to 72 of 1041inker'' 
frag join{721. •>774,<776. .792) 

/note="l to 72 of 1031inker [Split]" 
frag join(721. .>771,<772. •>774, <776. . 792) 

/note=«l to 78 of 1021inker [Split]" 
BASE COUNT 346 A 467 C 434 G 313 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC CAGCACCAGA GGGCAGGCTG 
781 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 
841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT GGAGAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 
961 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 

1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 

1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 
1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 
1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT GCTGAAGTGA 
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9 18 27 36 45 54 

5' ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 

M GWSCIILFLVATATGVH 

63 72 81 90 99 108 

TCC CAG GTG CAG CTG GTG CAG TCT GGG GCA. GAG GTG AAA AAG CCT GGG GCC TCA 

SQVQLVQSGAEVKKP GAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 

VKVSCKA SGYTF SAY WI E 

171 180 189 198 207 216 

TOG GTG CGC CAG OCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGKGLEWVGE ILP 

225 234 ' 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

G SNNSRYNEKFKGRV TVT 

279 288 297 J06 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTKTAYMELSSL RSE 

333 342 331 36D 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAW FAY 

3B7 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

~ W- ~G Q~ G T L V T V S S AST -K~ ■ G P S " 

441 450 459 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

•VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 ^ 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

GCLVKDYFPEPVTVS WNS 

549 558 567 576 585 5'94 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA ' 

GALTSGVH TFPAVLQSSG 
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.603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LY SLSSVVTVPSSS LG TQ 

657 666 675 684 693. 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG 



TY I C NVNH K P S NT K V D K K 

711 720 729 738 * 747 756 

GTT GAG CCC AAA TCT TGT GkC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 

VEPK SCDKT HTCCV EC PP 

765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGC AGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT 



CPAPEGRLKIAAFNIQTF 

819 828 837 846 855 864 

GG6 GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG 



GETKMSNATLVSYIVQ IL 

873 882 891 900 909 91-8 

AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC 



SRYDIALVQBVRDS HLTA 

927 936 945 954 963 972 

GTG GAG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC 



VEKLLDNLNQDAPD TYH Y 

981 990 999 1008 1017 1026 

GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG 



VVSEPLGRNSYKERYLFV 

1035 1044 1053 1062 1071 1080 

TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT GGC TGC 



YRPDQVSAVDSYYY DDGC 

1089 1098 1107 1116 . 1125 1134 

GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC 



EPCG N DTF NRE PA I V R F F 

1143 1152 1161 1170 1179 1188 

TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GGC CCG 



SRFTEVREFAIVPL HAAP 
1197 1206 1215 1224 1233 1242 

Fig. 16(C) 
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GGG GAC GCA GTA GCC GAG ATC GAC GCT CTC TAT GAC GTC TAG CTG GAT GTC CAA 

GDAVAEIDALYDVY t*DVQ 

1251 12£0 1269 1278 1287 1296 

GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TOO 

EKWGLEDVM LMGDFNAGC 

1305 1314 1323 1332 1341 1350 

AGC TAT jC3TG AGA CCC TOO GAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC 

SYVR, PSQWSS.IRLWTSPT 

1359 1368 1377 1386 1395 1404 

TTC GAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT 

FQWLIPDSADTTAT PTHC 

1413 1422 1431 1440 1449 1458 

GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC 

AYDRI VVAGMIiIiRGAVVJ*. 

1467 1476 1485 1494 1503 1512 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG 

DSAIi PFNFQA AYGLSDQT* 

1521 1530 1539 1548 1557 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG TGA 3 ' 

A. QAISDHYPVEVML K* 
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pASlOS 



LOCUS 
DEFINITION 
NLS(pAS105) 
ACCESSION 
NID 

KEYWORDS 
SOURCE 

MHDNASEl.dna) 
•ORGANISM Homo sapiens 



PASlOS. DNA 1578 bp mRNA PRI 06-MAR-1995 

Humanised HMFGl Fab '2 fused to human DNase I with SV40 



DNase I. 

DNase I sequence is from assembled oligos (thus modified c/f 



AUTHORS 
TITLE 
fibrosis 

JOURNAL 
MEDLINE 

BASE COUNT 

ORIGIN 



Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
Shak/S., Capon, D. J., Hellmiss,R., Marsters, S.A. and Baker^C.L. 
Recombinant human DNase I reduces the viscosity of cystic 

sputum 

Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 
91067672 

353 a 473 c 442 g 310 t 



1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCC AG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAATVAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCaVAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTCG^ACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGA AG GCGGG CTGAA GATCGCAGCC 
781 TTCAACATCC AGACATTTGG GGAGACCAAG ATGTCCAATG CCACCCTCGT CAGCTACATT 
841 GTGCAGATCC TGAGCCGCTA CGACATCGCC CTGGTCCAGG AGGTCAGAGA CAGCCACCTG 
901 ACTGCCGTGG GGAAGCTGCT GGACAACCTC AATCAGGACG CACCACSVCAC CTATCACTAC" 
961 GTGGTCAGTG AGCCACTGGG ACGGAACAGC TATAAGGAGC GCTACCTGTT CGTGTACAGG 
1021 CCTGACCAGG TGTCTGCGGT GGACAGCTAC TACTACGATG ATGGCTGCGA GCCCTGCGGG 
1081 AACGACACCT TCAACCGAGA GCCAGCCATT GTCAGGTTCT TCTCCCGGTT CACAGAGGTC 
1141 AGGGAGTTTG CCATTGTTCC CCTGCATGCG GCCCCGGGGG ACGCAGTAGC CGAGATCGAC 
1201 GCTCTCTATG ACGTCTACCT GGATGTCCAA GAGAAATGGG GCTTGGAGGA CGTCATGTTG 
1261 ATGGGCGACT TCAATGCGGG CTGCAGCTAT GTGAGACCCT CCCAGTGGTC ATCCATCCGC 
1321 CTGTGGACAA GCCCCACCTT CCAGTGGCTG ATCCCCGACA GCGCTGACAC CACAGCTACA 
1381 CCCACGCACT GTGCCTATGA CAGGATCGTG GTTGCAGGGA TGCTGCTCCG 7W3GGGCCGTT 
1441 GTTCCCGACT CGGCTCTTCC CTTTAACTTC CAGGCTGCCT ATGGCCTGAG TGACCAACTG 
1501 GCCCAAGCCA TCAGTGACCA CTATCCAGTG GAGGTGATGC TGRAGGGGGG CGGACCCAAA 
1561 AAGAAGCGCA AGGTTTGA ^^^^^^^^^^^^^L 
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LOCUS FDDNASE105 1578 BP SS-DNA SYN 25-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag join(l. .>720r <781. .1578) 

/note="l to 1578 of PASlOS.dna [Split]" 
frag 721.. 780 

/note="l to 60 of 101/1051inker" 
frag join(721. .>735,<736. .>759,<760. .>780) 

/note="l to 80 of 1021inker [Split]" 
BASE COUNT 353 A 471 C 443 G 311 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGTCC ACCGTGTCCA GCACCAGAGG GCGGGCTGAA GATCGCAGCC 
781 TTCAACATCC AGACATTTGG GGAGACCAAG ATGTCCAATG CCACCCTCGT CAGCTACATT 
841 GTGCAGATCC TGAGCCGCTA CGACATCGCC CTGGTCCAGG AGGTCAGAGA CAGCCACCTG 
901 ACTGCCGTGG GGAAGCTGCT GGACAACCTC AATCAGGACG CACCAGACAC CTATCACTAC 
961 GTGGTCAGTG AGCCACTGGG ACGGAACAGC TATAAGGAGC GCTACCTGTT CGTGTACAGG 
I 1021 CCTGACCAGG TGTCTGCGGT GGACAGCTAC TACTACGATG ATGGCTGCGA GCCCTGCGGG 
1081 AACGACACCT TCAACCGAGA GCCAGCCATT GTCAGGTTCT TCTCCCGGTT CACAGAGGTC 
1141 AGGGAGTTTG CCATTGTTCC CCTGCATGCG GCCCCGGGGG ACGCAGTAGC CGAGATCGAC 
1201 GCTCTCTATG ACGTCTACCT GGATGTCCAA GAGAAATGGG GCTTGGAGGA CGTCATGTTG 
1261 ATGGGCGACT TCAATGCGGG CTGCAGCTAT GTGAGACCCT CCCAGTGGTC ATCCATCCGC 
. . 1321 CTGTGGACT^ GCCCCACCTT* CCAGTGGCTG ATCCCCGACA GCGCTGACAC CACAGCTACA 
1381 CCCACGCACT GTGCCTATGA CAGGATCGTG GTTGCAGGGA TGCTGCTCCG AGGGGCCGTT 
1441 GTTCCCGACT CGGCTCTTCC CTTTAACTTC CAGGCTGCCT ATGGCCTGAG TGACCAACTG 
1501 GCCCAAGCCA TCAGTGACCA CTATCCAGTG GAGGTGATGC TGAAGGGGGG CGGACCCAAA 
1561 AAGAAGCGCA AGGTTTGA 
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LOCUS FDDNASE105 1587 BP SS-DNA SYN 29-ADG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10., 1587 

/note="l to 1578 of FdDNaselOScorrect" 
frag join(10. . >729, <790. . 1587) 

/note="l to 1578 of PASlOS.dna [Split]" 
frag 730.. 789 

/note="l to 60 of 101/1051inker" 
frag join{730. .>744,<745. .>768,<769. .>789) 

/note="l to 80 of 1021inker [Split]" 
BASE COUNT 354 A 477 C 445 G 311 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGTCCA CCGTGTCCAG CACCAGAGGG CGGGCTGAAG 
781 ATCGCAGCCT TCAACATCCA GACATTTGGG GAGACCAAGA TGTCCAATGC CACCCTCGTC 
841 AGCTACATTG TGCAGATCCT GAGCCGCTAC GACATCGCCC TGGTCCAGGA GGTCAGAGAC 
901 AGCCACCTGA CTGCCGTGGG GAAGCTGCTG GACAACCTCA ATCAGGACGC ACCAGACACC 
961 TATCACTACG TGGTCAGTGA GCCACTGGGA CGGAACAGCT ATAAGGAGCG CTACCTGTTC 
- 1021 GTGTACAGGC CTGACCAGGT GTCTGCGGTG GACAGCTACT ACTACGATGA TGGCTGCGAG 
1081 CCCTGCGGGA ACGACACCTT CAACCGAGAG CCAGCCATTG TCAGGTTCTT CTCCCGGTTC 
1141 ACAGAGGTCA GGGAGTTTGC CATTGTTCCC CTGCATGCGG CCCCGGGGGA CGCAGTAGCC 
1201 GAGATCGACG CTCTCTATGA CGTCTACCTG GATGTCCAAG AGAAATGGGG CTTGGAGGAC 
1261 GTCATGTTGA TGGGCGACTT CAATGCGGGC TGCAGCTATG TGAGACCCTC CCAGTGGTCA 
1321 TCCATCCGCC TGTGGACAAG CCCCACCTTC CAGTGGCTGA TCCCCGACAG CGCTGACACC 
1381 ACAGCTACAC CCACGCACTG TGCCTATGAC AGGATCGTGG TTGCAGGGAT GCTGCTCCGA 
1441 GGGGCCGTTG TTCCCGACTC GGCTCTTCCC TTTAACTTCC AGGCTGCCTA TGGCCTGAGT 
1501 GACCAACTGG CCCAAGCCAT CAGTGACCAC TATCCAGTGG AGGTGATGCT GAAGGGGGGC 
1561 GGACCCAAAA AGAAGCGCAA GGTTTGA 
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9 18 27 36 45 54 

5 ' ATG GGA TGG AGO TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC . 

MGWSC IILFLVATATGVH 

63 72 ^ 81 90* 99 108 

TCC CAG GTG CAG CTG GTG CAG TCt' GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA - 

SQVQLVQSGAEVK KPGAS 

117 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAG ACC TTC AGT GCC TAG TGG ATA GAG 

V KVS C KA S.GYT F S A YW I E 

171 180 189 198 -207 216 

TGG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 

WVRQAPGK'GIiEWVGEILP 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 

GSNNSRYNEKFKGRVTVT 

279 288 297 306 315 324 

AGA GAG ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 

RDTSTNTAYMELS SLR. SE 

333 342 351 360 369 378 

GAG ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 

DTAVYYCARSYDFAWFAY 

387 396 405 414 423 432 

TCG GGC CAA GGG ACT CTG GTC ACA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 

" W~ G Q G t" t " V T' V S S A S T K G~ P S~ 

441 450 459 468 477 486' 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 

VFPLAPSSKSTSGGTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 

G CLVKDYPPEPVTVSWNS 

549 558 567 576 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC CTA CAG TCC TCA GGA 

GALTSGVHTFPAVL.QSSG 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 



LYSLSSVVTVPSSSLGTQ 

657 666 675 684 693 702 

ACC TAC ATC TCC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYIC NVNHKPSNTKVDKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC CCA CCG TGC CCA GCA CCT 



VEPKSCD. KTHTC PPC PAP 

765 774 783 792 801 810 

GAA GGC GGG CTC AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT GGG GAG ACC AAG 



EGGLKIAAFNIQTFGETK 

819 828 837 8'46 855 864 

ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG AGC CGC TAC GAC 

MSNATL. VSYIVQILSRYD 

873 882 891 900 909 918 

ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC GTG GGG AAG CTG 

lALVQEVRDSHLTAVGK L 

321 "9316 945 954 963 -972 

CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC GTG GTC AGT GAG 



LDNLNQDAPDTYHYVVSE 

981 990 '993 1008 1017 1«2« 

CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG TAC AGG CCT GAC 



PDGRNSYKBRYLFVYRPD 

1035' * 1044 1053 1062 1071 1080 

CAG GTG TCT GCG GTG-<3AC AGC WC TAC TAC GAT GAT GGC TGC GAG -CCC TGC GGG 



QVSAVDSYYY DD GC E P C G 

1089 . 1098 1107 1116 1125 * 1134 

AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC TCC CGG TTC ACA 

NDTFNREP .AI VRFFS RFT 

1143 M'52 1161 1170 1179 1188 

GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG GGG GAC GCA GTA 



EVREFAIVPLHAAPGDAV 
1197 1206 1215 1224 1233 1242 
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GCC GAG ATC GAC GCT CTC TAT GAC GTC TAC CTG GAT GTC CAA GAG AAA TGG GGC 

A*E I D A L y D V Y L D V Q E K W G 

1251 1260 1269 121Q 1287 1296 

TTG GAG GAC GTC . ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC AGC TAT GTG AGA 

LEDVMLMGDFNAGC SYVR 

1305 1314 ' 1323 ' 1332 1341 1350 

CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC TTC CAG TGG CTG 

PS.QWS S I RLWTSPT F QWL 

1359 1368 1377 1386 1395 1404 

ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT GCC TAT GAC AGG 

IPD.SADTTATPTHCAYDR 

1413 1422 1431 1440 1449 1458 

ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC GAC TCG GCT CTT 

IVVAGMLLRGAVVPDSAIi 

1467 147 6 1485 1494 15 03 15 12 

CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC CAA CTG GCC CAA GCC ATC 

PFNFQAAYGLSDQLAQAl 

1521 1530 1539 1548 1557 1566 

AGT. GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA CCC AAA.-AAG AAG 

SDHYPVEVMLKGGGPKKK 

1575 

CGC AAtS GTT TGA 3' 
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PAS106 

LOCUS PAS106.DNA 1596 bp mRNA PRI 06-MAR-1995 

DEFINITION Humanised HMFGl Fab '2 fused to human DNase I with SV40 

NLS (pASlOG) 

ACCESSION 

NID 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASEl.dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. . 
AUTHORS Shak/S., Capon, D. J. ^ Hellmiss,R., Marsters, S.A. and Baker, C.L. 
TITLE Recombinant human* DNase I reduces the viscosity of cystic . 
fibrosis 

sputum 

JOURNAL Proc. Natl. Acad. Sci, U.S.A. 87 (23), 9188-9192 (1990) 

MEDLINE 91067672 
BASE COUNT 355 a 475 c 452 g 314 t 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACA66TGT CCACTCCC AG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAMCTC ACAC ATGCTG TGTGGAGTGC CCACCGTGCC CAGCACCTGA AGGGAGCGGC 
781 GGGC TGAAGA TCGCAGCCTT CAACATCCAG ACATTTGGGG AGACCAAGAT GTCCAATGCC^ 
841 ACCCTCGTCA GCTACATTGT GCAGATCCTG AGCCGCTACG ACATCGCCCT GGTCCAGGAG 
901 GTCAGAGACA GCCACCTGAC TGCCGTGGGG AAGCTGCTGG ACAACCTCAA TCAGGACGCA 
961 CCAGACACCT ATCACTACGT GGTCAGTGAG CCACTGGGAC GGAACAGCTA TAAGGAGCGC 
1021 TACCTGTTCG TGTACAGGCC TGACCAGGTG TCTGCGGTGG ACAGCTACTA CTACGATGAT 
1081 GGCTGCGAGC CCTGCGGGAA CGACACCTTC AACCGAGAGC CAGCCATTGT CAGGTTCTTC 
1141 TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC ATTGTTCCCC TGCATGCGGC CCCGGGGGAC 
1201 GCAGTAGCCG AGATCGACGC TCTCTATGAC GTCTACCTGG ATGTCCAAGA GAAATGGGGC 
1261 TTGGAGGACG TCATGTTGAT GGGCGACTTC AATGCGGGCT GCAGCTATGT GAGACCCTCC 
1321 CAGTGGTCAT CCATCCGCCT GTGGACAAGC CCCACCTTCC AGTGGCTGAT CCCCGACAGC 
1381- GCTGACACCA CAGCTACACC CACGCACTGT GCCTATGACA GGATCGTGGT TGCAGGGATG 
1441 CTGCTCCGAG GGGCCGTTGT TCCCGACTCG GCTCTTCCCT TTAACTTCCA GGCTGCCTAT 
1501 GGCCTGAGTG ACCAACTGGC CCAAGCCATC AGTGACCACT ATCCAGTGGA GGTGATGCTG 
1561 A AGGGGGSCG GACCCftftAftA. GAAGCGCAAG 6TTT GA 
// ===== 



NLS 
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LOCUS FDDNASE106 1596 BP SS-DNA SYN 25--AOG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag joind- .>720,<799. . 1596) 

/note="l to 1596 of PAS106.dna [Split]" 
frag 721.. 798 

/note="l to 78 of 102/1061inker" 
BASE COUNT 355 A 474 C 452 G 315 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAA6TTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCTGA CCAGCGGCGT GGACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCTG TGTCGAGTGT CCACCGTGTC CAGCACCAGA GGGGAGCGGC 
781 GGGCTGAAGA TCGCAGCCTT CAACATCCAG ACATTTGGGG AGACCAAGAT GTCCAATGCC 
841 ACCCTCGTCA GCTACATTGT GCAGATCCTG AGCCGCTACG ACATCGCCCT GGTCCAGGAG 
901 GTCAGAGACA GCCACCTGAC TGCCGTGGGG AAGCTGCTGG ACAACCTCAA TCAGGACGCA 
961 CCAGACACCT ATCACTACGT GGTCAGTGAG CCACTGGGAC GGAACAGCTA TAAGGAGCGC 
1021 TACCTGTTCG TGTACAGGCC TGACCAGGTG TCTGCGGTGG ACAGCTACTA CTACGATGAT 
i 1081 GGCTGCGAGC CCTGCGGGAA GGACACCTTC AACCGAGAGC CAGCCATTGT CAGGTTCTTC 
1141 TCCCGGTTCA CAGAGGTCAG GGAGTTTGCC ATTGTTCCCC TGCATGCGGC CCCGGGGGAC 
1201 GCAGTAGCCG AGATCGACGC TCTCTATGAC GTCTACCTGG ATGTCCAAGA GAAATGGGGC 
1261 TTGGAGGACG TCATGTTGAT GGGCGACTTC AATGCGGGCT GCAGCTATGT GAGACCCTCC 
1321 CAGTGGTCAT CCATCCGCCT GTGGACAAGC CCCACCTTCC AGTGGCTGAT CCCCGACAGC 
1381 GCTGACACCA CAGCTACACC CACGCACTGT GCCTATGACA GGATCGTGGT TGCAGGGATG 
1441 CTGCTCCGAG GGGCCGTTGT TCCCGACTCG GCTCTTCCCT TTAACTTCCA GGCTGCCTAT 
1501 GGCCTGAGTG ACCAACTGGC CCAAGCCATC AGTGACCACT ATCCAGTGGA GGTGATGCTG 
1561 AAGGGGGGCG GACCCAAAAA GAAGCGCAAG GTTTGA 

// 



Fig. 18(B) 
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LOGOS FDDNASE106 1605 BP SS-DNA SYN 29-ADG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 1605 

/note="l to 1596 of FdDNaseloecorrect" 
frag joindO. .>729,<80B . - 1605) 

/note="l to 1596 of PAS106.dna [Split]" 
frag 730., 807 

/note-"l to 78 of 102/1061inker" 
BASE COUNT 356 A 480 C 454 G 315 T 0 OTHER 

ORIGIN 

1 GCCGCCACCA TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAG(3TGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACAGTCACTA GAGACACATC CACAAACACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG C6TGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGCTGT GTCGAGTGTC CACCGTGTCC AGCACCAGAG 
781 GGGAGCGGCG GGCTGAAGAT CGCAGCCTTC AACATCCAGA CATTTGGGGA GACCAAGATG 
841 TCCAATGCCA CCCTCGTCAG CTACATTGTG CAGATCCTGA GCCGCTACGA CATCGCCCTG 
901 GTCCAGGAGG TCAGAGACAG CCACCTGACT GCCGTGGGGA AGCTGCTGGA CAACCTCAAT 
961 CAGGACGCAC CAGACACCTA TCACTACGTG GTCAGTGAGC CACTGGGACG GAACAGCTAT 
1021 AAGGAGCGCT ACCTGTTCGT GTACAGGCCT GACCAGGTGT CTGCGGTGGA CAGCTACTAC 
1081 TACGATGATG GCTGCGAGCC CTGCGGGAAC GACACCTTCA ACCGAGAGCC AGCCATTGTC 
1141 AGGTTCTTCT CCCGGTTCAC AGAGGTCAGG GAGTTTGCCA TTGTTCCCCT GCATGCGGCC 
. 1201 CCGGGGGACG CAGTAGCCGA GATCGACGCT CTCTATGACG TCTACCTGGA TGTCCAAGAG 
1261 AAATGGGGCT TGGAGGACGT CATGTTGATG GGCGACTTCA ATGCGGGCTG CAGCTATGTG 
1321 AGACCCTCCC AGTGGTCATC CATCCGCCTG TGGACAAGCC CCACCTTCCA GTGGCTGATC 
1381 CCCGACAGCG CTGACACCAC AGCTACACCC ACGCACTGTG CCTATGACAG GATCGTGGTT 
1441 GCAGGGATGC TGCTCCGAGG GGCCGTTGTT CCCGACTCGG CTCTTCCCTT TAACTTCCAG 
1501 GCTGCCTATG GCCTGAGTGA CCAACTGGCC CAAGCCATCA GTGACCACTA TCCAGTGGAG 
1561 GTGATGCTGA AGGGGGGCGG ACCCAAAAAG AAGCGCAAGG TTTGA 

// 
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9 18 27 36 45 54 

ATG GGA TGG AGC TGT ATC ATC CTC TTC TTG GTA GCA ACA GCT ACA GGT GTC CAC 



MGWSCI ILPLV AT ATG V H 

63 72 81 90 99 108 

TCG CAG GTG CAG CTG GTG CAG TCT GGG GCA GAG GTG AAA AAG CCT GGG GCC TCA 



SQVQ. LVQSG'AEVKKPGAS 

117. 126 135 144 153 162 

GTG AAG GTG TCC TGC AAG GCT TCT GGC TAC ACC TTC AGT GCC TAC TGG ATA GAG 



VK VSC.KAS GY 'T F S A YW I E 

• 171 180 189 198 207 ' 216 

TCG GTG CGC CAG GCT CCA GGA AAG GGC CTC GAG TGG GTC GGA GAG ATT TTA CCT 



WV RQAP.GK GL E WV G E I L P 

225 234 243 252 261 270 

GGA AGT AAT AAT TCT AGA TAC AAT GAG AAG TTC AAG GGC CGA GTG ACA GTC ACT 



G.SNNSRYNEKF KGRVTVT 

279 288 297 306 315 324 

AGA GAC ACA TCC ACA AAC ACA GCC TAC ATG GAG CTC AGC AGC CTG AGG TCT GAG 



R DTSTNTAYMELS SLRSE 

333 342 351 360 369 378 

GAC ACA GCC GTC TAT TAC TGT GCA AGA TCC TAC GAC TTT GCC TGG TTT GCT TAC 



i DTAVY.YCARSYDFAWFAY 

f 

387 396 405 414 423 432 

TGG GGC CAA GGG ACT CTG GTC ^CA GTC TCC TCA GCC TCC ACC AAG GGC CCA TCG 



.G Q . 6. T L: V T V S .S A S. ,T K G P S 

441 450 459 * 468 477 486 

GTC TTC CCC CTG GCA CCC TCC TCC AAG AGC ACC TCT GGG GGC ACA GCG GCC CTG 



VF PLAPSSKS.T SG GTAAL 

495 504 513 522 531 540 

GGC TGC CTG GTC AAG GAC TAC TTC CCC GAA CCG GTG ACG GTG TCG TGG AAC TCA 



GCLV .KDYFPEPVTVSWNS 

549 558 567 576 * 585 594 

GGC GCC CTG ACC AGC GGC GTG CAC ACC TTC CCG GCT GTC' CTA CAG TCC TCA GGA 



GALTSGVHTFPAVLQS S-G 

Fig. 18(C) 
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603 612 621 630 639 648 

CTC TAC TCC CTC AGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYS LSSVVTVPSSSLGTQ 

657 666 675 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG GAC AAG AAA 



TYICNVNHKPSNTKV DKK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 



V E P K S C D.K T H T C C V E.C P P 

765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGG AGC GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC -CAG 



CPAPEGS6GLKIAAFNIQ 

819 828 837 846 855 864 

ACA TTT GGG GAG ACC AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG 



TF6ETKMS NATLVSY IVQ 

873 882 891 900 909 918 

ATC CTG AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG 



ILSRYDIALVQEVRDSH L 

927 936 945 954 963 972 

ACT' GCC GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT 



TAVGKLLDWLNQDAP DTY 

981 990 999 1008 1017 1026 

CAC TAC GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG 



HYVVSEPLGRNS YKB RYL 

1035 1-044 1053 1062 1071 1080 

TTC GTG TAC AGG CCT GAC CAG GTG TCT GCG GTG GAC AGC TAC TAC TAC GAT GAT 

FVYRPDQVSAVDSY'YY DD 

1089 1098 1107 1116 1125 1134 

GGC TGC GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG 



GC E PCGNDT FN R E PA. I V R*. 

1143 1152 1161 1170 1179 1188 

TTC TTC TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG 



•FFS RFTEVREFA I VP LHA 
1197 1206 1215 1224 1233 1242 

Fig. 18(C) - 
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GCC CCG GGG GAC GCA GTA GCC GAG ATC GAC GOT CTC TAT GAC GTC TAG CTG GAT 

APdDA VA. EIDALYDVYL D 

1251 1260 1269 1278 1287 " 1296 

GTC CAA GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG 

VQEKWGLEDVMLMGDFNA 

1305 1314 1323 1332 1341 ' 1350 

GGC TGC AGC TAT GTG AGA CCC TCC CAG TGG. TCA TCC ATC CGC CTG TGG ACA AGC 

GCSYVRPSQW SSIRLWTS 

1359 1368 1377 1386 1395 1,404 

CCC ACC TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG 

PTF QWLIPDSADTTATPT 

1413 1422 1431 1440 1449 1458 

CAC TGT GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT 

HCAYDRIVVAGMLLRGAV 

1467 1476 1485 1494 1503 1S12 

GTT CCC GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC CTG AGT GAC 

VPDSALPFNFQAAYGLSD 

1521 1530 1539 1548 1557 1566 

CAA CTG GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG 

Q LA Q A I S D H Y P V E V M L K G 

1575 1584 1593 

GGC GGA CCC AAA AAG AAG CGC AAG GTT TGA 3' 

GGPKKKRKV* 



Fig. 18(C) 

(Sheet 3 of 3) 



wo 01/74905 



PCT/GBOl/01324 



87/113 



pASlO? 



LOCUS PAS107.DNA 1590 bp mRNA PRI 06-MAR-1995 

DEFINITION Hiamanised HMFGl Fab '2 fused to human DNase I with SV40 
NLS (pAS107) 
ACCESSION 

NID * ^ 

KEYWORDS DNase I. 

SOURCE DNase I sequence is from assembled oligos (thus modified c/f 

MHDNASEl.dna) 

ORGANISM Homo sapiens 

Eukaryotae; mitochondrial eukaryotes; Metazoa; Chorda ta; 
Vertebrata; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
AUTHORS Shakes. r Capon, D. J., Hellmiss,R., Marsters^S.A. and Baker, C.L. 
TITLE Recombinant human DNase I reduces the viscosity of cystic 
fibrosis 

sputum 

JOURNAL Proc. Natl. Acad. Sci. U.S.A. 87 (23), 9188-9192 (1990) 

MEDLINE 91067672 
BASE COUNT 354 a 474 c 448 g 314 t 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACA6 CTACAGGTGT CCACTCC CAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACA7y\CAC AGCCTACATG 
\ 301 GAGCTCAGCA .GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGT^CTCA 
541 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCXCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACAC ATGCTG TGTGGAGTGC CCACCGTGCC CAGCACCTGA AGGCGGG CTG 
781 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 
841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT GGGGAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 
961 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 
1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 
1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 
1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 
1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGQAGGTGAT GCTGA AGGGG 
1561 GGCGGACCCA AAAAGAAGCG CAAGGTTTGA = 



I ►NLS 

Fig. 19(A) 



wo 01/74905 PCT/GBOl/01324 



88/113 



LOCUS FDDNASE107 1590 BP SS-DNA SYN 25-AUG-2000 

DEFIKITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 

frag join (1. •>720,<793. . 1590) 

/note="l to 1590 of PAS107.dna [Split]" 
frag 721.. 7 92 

/note="l to 72 of 103/1071inker" 
frag join (721. .>771,<772. ,792) 

/note="l to- 78 of 1021inker [Split]" 
BASE COUNT 354 A 473 C 448 G 315 T 0 OTHER 

ORIGIN 

1 ATGGGATGGA GCTGTATCAT CCTCTTCTTG GTAGCAACAG CTACAGGTGT CCACTCCCAG 
61 GTGCAGCTGG TGCAGTCTGG GGCAGAGGTG AAAAAGCCTG GGGCCTCAGT GAAGGTGTCC 
121 TGCAAGGCTT CTGGCTACAC CTTCAGTGCC TACTGGATAG AGTGGGTGCG CCAGGCTCCA 
181 GGAAAGGGCC TCGAGTGGGT CGGAGAGATT TTACCTGGAA GTAATAATTC TAGATACAAT 
241 GAGAAGTTCA AGGGCCGAGT GACAGTCACT AGAGACACAT CCACAAACAC AGCCTACATG 
301 GAGCTCAGCA GCCTGAGGTC TGAGGACACA GCCGTCTATT ACTGTGCAAG ATCCTACGAC 
361 TTTGCCTGGT TTGCTTACTG GGGCCAAGGG ACTCTGGTCA CAGTCTCCTC AGCCTCCACC 
421 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
481 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
541 GGCGCCCT.GA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
601 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
661 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGTTGAGCC CAAATCTTGT 
721 GACAAAACTC ACACATGCTG TGTCGAGT6T CCACCGTGTC CAGCACCAGA GGGCGGGCTG 
781 AAGATCGCAG CCTTCAACAT CCAGACATTT GGGGAGACCA AGATGTCCAA TGCCACCCTC 
841 GTCAGCTACA TTGTGCAGAT CCTGAGCCGC TACGACATCG CCCTGGTCCA GGAGGTCAGA 
901 GACAGCCACC TGACTGCCGT GGGGAAGCTG CTGGACAACC TCAATCAGGA CGCACCAGAC 
; 961 ACCTATCACT ACGTGGTCAG TGAGCCACTG GGACGGAACA GCTATAAGGA GCGCTACCTG 
1021 TTCGTGTACA GGCCTGACCA GGTGTCTGCG GTGGACAGCT ACTACTACGA TGATGGCTGC 
1081 GAGCCCTGCG GGAACGACAC CTTCAACCGA GAGCCAGCCA TTGTCAGGTT CTTCTCCCGG 
1141 TTCACAGAGG TCAGGGAGTT TGCCATTGTT CCCCTGCATG CGGCCCCGGG GGACGCAGTA 
1201 GCCGAGATCG ACGCTCTCTA TGACGTCTAC CTGGATGTCC AAGAGAAATG GGGCTTGGAG 
1261 GACGTCATGT TGATGGGCGA CTTCAATGCG GGCTGCAGCT ATGTGAGACC CTCCCAGTGG 
1321 TCATCCATCC GCCTGTGGAC AAGCCCCACC TTCCAGTGGC TGATCCCCGA CAGCGCTGAC 
1381 ACCACAGCTA CACCCACGCA CTGTGCCTAT GACAGGATCG TGGTTGCAGG GATGCTGCTC 
1441 CGAGGGGCCG TTGTTCCCGA CTCGGCTCTT CCCTTTAACT TCCAGGCTGC CTATGGCCTG 
1501 AGTGACCAAC TGGCCCAAGC CATCAGTGAC CACTATCCAG TGGAGGTGAT GCTGAAGGGG 
1561 GGCGGACCCA AAAAGAAGCG CAAGGTTTGA 

// 



Fig. 19(B) 
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LOCUS FDDNASE107 1599 BP SS-DNA SYN 29-AUG-2000 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES Location/Qualifiers 
frag 10.. 1599 

/note="l to 1590 of FdDNaselO 7 correct" 
frag join(10. .>729,<802. .1599) 

/note="l to 1590 of PAS107.cina [Split]" 
frag 730,. 801 

/note="l to 72 of 103/1071inker" 
frag join(730. .>780,<781. .801) 

/note="l to 78 of 1021inker [Split]" 
BASE COUNT 355 A 479 C 450 G 315 T 0 OTHER 

ORIGIK 

1 GCCGCCACC A TGGGATGGAG CTGTATCATC CTCTTCTTGG TAGCAACAGC TACAGGTGTC 
61 CACTCCCAGG TGCAGCTGGT GCAGTCTGGG GCAGAGGTGA AAAAGCCTGG GGCCTCAGTG 
121 AAGGTGTCCT GCAAGGCTTC TGGCTACACC TTCAGTGCCT ACTGGATAGA GTGGGTGCGC 
181 CAGGCTCCAG GAAAGGGCCT CGAGTGGGTC GGAGAGATTT TACCTGGAAG TAATAATTCT 
241 AGATACAATG AGAAGTTCAA GGGCCGAGTG ACA6TCACTA GAGACACATC CACATVAGACA 
301 GCCTACATGG AGCTCAGCAG CCTGAGGTCT GAGGACACAG CCGTCTATTA CTGTGCAAGA 
361 TCCTACGACT TTGCCTGGTT TGCTTACTGG GGCCAAGGGA CTCTGGTCAC AGTCTCCTCA 
421 GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
481 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
541 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 
601 GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
661 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
721 AAATCTTGTG ACAAAACTCA CACATGCTGT GTCGAGTGTC CACCGTGTCC AGCACCAGAG 
\ 781 GGCGGGCTGA AGATCGCAGC CTTCAACATC CAGACATTTG GGGAGACCAA GATGTCCAAT 
) 841 GCCACCCTCG TCAGCTACAT TGTGCAGATC CTGAGCCGCT ACGACATCGC CCTGGTCCAG 
901 GAGGTCAGAG ACAGCCACCT GACTGCCGTG GGGAAGCTGC TGGACAACCT CAATCAGGAC 
961 GCACCAGACA CCTATCACTA CGTGGTCAGT GAGCCACTGG GACGGAACAG CTATAAGGAG 
1021 CGCTACCTGT TCGTGTACAG GCCTGACCAG GTGTCTGCGG TGGACAGCTA CTACTACGAT 
1081 GATGGCTGCG AGCCCTGCGG GAACGACACC TTCAACCGAG AGCCAGCCAT TGTCAGGTTC 
1141 TTCTCCCGGT TCACAGAGGT CAGGGAGTTT GCCATTGTTC CCCTGCATGC GGCCCCGGGG-. 
1201 GACGCAGTAG CCGAGATCGA CGCTCTCTAT GACGTCTACC TGGATGTCCA AGAGAAATGG 
1261 GGCTTGGAGG ACGTCATGTT GATGGGCGAC TTCAATGCGG GCTGCAGCTA TGTGAGACCC 
1321 TCCCAGTGGT CATCCATCCG CCTGTGGACA AGCCCCACCT TCCAGTGGCT GATCCCCGAC 
1381 AGCGCTGACA CCACAGCTAC ACCCACGCAC TGTGCCTATG ACAGGATCGT GGTTGCAGGG 
1441 ATGCTGCTCC GAGGGGCCGT TGTTCCCGAC TCGGCTCTTC CCTTTAACTT CCAGGCTGCC 
1501 TATGGCCTGA GTGACCAACT GGCCCAAGCC ATCAGTGACC ACTATCCAGT GGAGGTGATG 
1561 CTGAAGGGGG GCGGACCCAA AAAGAAGCGC AAGGTTTGA 

// 



Fig. 19(C) 
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603 612 621 630 639 645 

CTC TAC TCC CTC hGC AGC GTG GTG ACC GTG CCC TCC AGC AGC TTG GGC ACC CAG 

LYSLSSVVTVPSSSLG TQ 

657 666 675. 684 693 702 

ACC TAC ATC TGC AAC GTG AAT CAC AAG CCC AGC AAC ACC AAG GTG* GAC AAG AAA 



TYICNV .NHKPSNTKVD KK 

711 720 729 738 747 756 

GTT GAG CCC AAA TCT TGT GAC AAA ACT CAC ACA TGC TGT GTG GAG TGC CCA CCG 



VEPKSCDKTHTCCVEC P. P 

765 774 783 792 801 810 

TGC CCA GCA CCT GAA GGC GGG CTG AAG ATC GCA GCC TTC AAC ATC CAG ACA TTT 

CPAPEGGLKIAAFNIQTF 

819 828 837 846 855 864 

GGG GAG ACX: AAG ATG TCC AAT GCC ACC CTC GTC AGC TAC ATT GTG CAG ATC CTG 



GETKMSMATLVSYIVQ IL 

873 882 891 900 909 918 

AGC CGC TAC GAC ATC GCC CTG GTC CAG GAG GTC AGA GAC AGC CAC CTG ACT GCC 



SRYDIALVQ EVRDSHL TA 

927 936 945 954 963 972 

GTG GGG AAG CTG CTG GAC AAC CTC AAT CAG GAC GCA CCA GAC ACC TAT CAC TAC 



VGKLLDNLNQDAPDTYHY 

981 990 999 1008 1017 10.26 

GTG GTC AGT GAG CCA CTG GGA CGG AAC AGC TAT AAG GAG CGC TAC CTG TTC GTG 

VVSEPLG. RNSYKERYL PV 

1035 1044 1053 1062 1071 1080 

tac agg cct gac cag gtg tct gcg gtg gac agc tac tac tac gat gat ggc tgc 
yrpdqvsav'dsyyyddgc 

1089 1098 1107 1116 . 1125 1134 

GAG CCC TGC GGG AAC GAC ACC TTC AAC CGA GAG CCA GCC ATT GTC AGG TTC TTC 



EPCGN*DTFNREPAIVRFF 

1143 1152 1161 1170 1179 1188 

TCC CGG TTC ACA GAG GTC AGG GAG TTT GCC ATT GTT CCC CTG CAT GCG GCC CCG 



SRFTEVREFAIVPL H- AAP 
1197 1206 1215 1224 1233 1242 

Fig. 19(D) 
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GGG GAC GCA GTA C3CC GAG ATC GAC GCT CTC TAT GAC GTC TAG CTG GAT GTC CAA 
GDAVAEIDALYDVYLDVQ 

1251 1260 1269 1278 1287 1296 

GAG AAA TGG GGC TTG GAG GAC GTC ATG TTG ATG GGC GAC TTC AAT GCG GGC TGC 

EKWGLEDVMLMGDFNAGC 

1305 1314 1323 1332 . 1341 1350 

AGC TAT GTG AGA CCC TCC CAG TGG TCA TCC ATC CGC CTG TGG ACA AGC CCC ACC 

SYVRPSQWSSIRLWTSPT 

1359 1368 1377 1386 1395 1404 

TTC CAG TGG CTG ATC CCC GAC AGC GCT GAC ACC ACA GCT ACA CCC ACG CAC TGT 

FQWLIPDSADTTATPTHC 

1413 • 1422 1431 1440 1449 1458 

GCC TAT GAC AGG ATC GTG GTT GCA GGG ATG CTG CTC CGA GGG GCC GTT GTT CCC 

AYDRIVVAGMLLRGAVVP 

1467 1476 1485 1494 1503 1512 

GAC TCG GCT CTT CCC TTT AAC TTC CAG GCT GCC TAT GGC GTG AGT GAC CAA CTG 

DSALPFNF QAAYGLSDQL 

1521 1530 1539 1548 ' 1557 XS^-S 

GCC CAA GCC ATC AGT GAC CAC TAT CCA GTG GAG GTG ATG CTG AAG GGG GGC GGA 

AQA ISD HYPVEVML K-GGG 

1575 1584 
CCC AAA AAG AAG CGC AAG GTT TGA 3* 

PKKKRKV* 
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Mammalian expression of humanised HiyiFG1-D Nase constructs 




hCMV 




Fig. 20 
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Fab-DNose. 




hDNasel 



Fig. 28(B) 
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Fig. 29 
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Fig. 31(A) 
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Fig. 33(A) 
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Bovine DNase I standard curves at various time points 
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